BE  HU  uUr^ 


AD-A229  993 


Progress  in  Characterizing  Strictly  Unidimensional 

IRT  Representations 


Brian  W.  Junker^ 

Department  of  Statistics 
University  of  Illinois  at  Champaign-Urbana 

November  29,  1990 


DTIC 

ELECTS 


Prepared  for  the  Cognitive  Science  Research  Program,  Cognitive  and  Neural  Sciences  Division. 
OfTice  o'"  Naval  Research,  under  contract  number  N0001‘}-90-J-1984,  4421-560 — 01.  Approved  for 
public  rele^e;  distribution  unlimited.  Reproduction  in  whole  or  in  part  is  permitted  for  any  purpose 
of  the  United  States  Government. 


'Current  address:  Department  of  Statistics,  Carnegie  Mellon  University,  Pittsburgh  P.A.  15213. 


REPORT  DOCUMENTATION  PAGE 


form  Approved 
0MB  No  0704  0188 


la  REPORT  SECURITY  CLASSIEICATlOM 

Unclassified 


2a  security  CLASSIFICATION  AUTHORITY 


2b  DECLASSIFICATION  /  DOWNGRADING  SCHEDULE 


4  PERFORMING  ORGANIZATION  REPORT  NUMBER(S) 


lb  RESTRICTIVE  MARKINGS 


3  DISTRIBUTION  /  availability  OF  REPORT 

Approved  for  public  release; 
distribution  unlimited. 


5  MONITORING  ORGANIZATION  REPORT  NUMBER(S) 


6c.  ADDRESS  {City.  State,  and  ZIP  Code) 

101  mini  Hall 
725  S.  Wright  Street 
Champaign,  IL  61820 


8a  NAME  OF  FUNDING  .'SPONSORING 
ORGANIZATION 


6b  OFFICE  SYMBOL 
(If  applicable) 


7a  NAME  OF  MONITORING  ORGANIZATION 

Cognitive  Science  Program 
Office  of  Naval  Research  (Code  1142CS 


7b  ADDRESS  (City,  State,  and  ZIP  Code) 

800  N.  Quincy  Street 
Arlington,  VA  22217-5000 


8b  OFFICE  SYMBOL  9  PROCUREMENT  INSTRUMENT  IDENTIFICATION  NUMBER 

(If  applicable)  N00014-90-J-1984 


8c  ADDRESS  (C/fy,  State,  and  ZIP  Code) 


1 1  Title  (include  Security  Classification) 


10  SOURCE  OF  FUNDING  NUMBERS 


PROGRAM  I  PROJECT 

ELEMENT  NO  I  NO 


61153N 


RR04204 


WORK  UNIT 
ACCESSION  NO 

4421-560 


Progress  in  characterizing  strictly  unidimensional  IRT  representations. 


12  personal  AUTH0R<S) 

Brian  W.  Junker  (Dept,  of  Statistics,  Carnegie  Mellon  Universi 


13a  TYPE  of  report  13b  TIME  COVERED  14  DATE  OF  REPORT  (Year,  Month.  Day)  15  PAGE  COUNT 

Final  Report  FROMApr  1990ToSeD  1990  1990  November  29  35 


16  SUPPLEMENTARY  NOTATION 

To  be  submitted  for  publication  in  Annals  of  Statistics. 


17  COSATI  COOES  I  IB  SUBJECT  TERMS  (Continue  on  reverse  if  necessary  and  identify  by  block  number) 

GROUP  I  SUB-GROUP  | 

See  reverse 


19  ABSTRACT  (Continue  on  reverse  if  necessary  and  identify  by  block  number) 


See  reverse. 


20  Distribution  .  availability  of  abstract 
E]  UNCLASSIFIED.UNLIMITED  □  SAME  AS  RPT 


22a  r.AiviF  Of  RESPONSIBLE  iNDlVIOilAl 

Dr.  Charles  E.  Davis 


DD  Form  1473,  JUN  86  / 


□  DTIC  USERS 


21  ABSTRACT  security  CLASSIFICATION 


22b  TELEPHONE  (Include  Area  Code)  22c  OFFICE  SYMBOL 

(703)  696-4046  ONR-1142-CS 


Previous  editions  are  obsolete 

S/N  0102-LF-014-6603 


security  CLASSIFICATION  OF  THIS  PAGE 


Abstract 


Considerable  attention  has  been  paid  to  the  development  of  nonparametric  conditions 
on  P[X_j  =  that  characterize  a  =  1  (locally  independent,  monotone,  unidimen¬ 
sional)  latent  variable  representation  for  the  binary  items  2C/  =  •  •  • ,  Holland 

and  Rosenbaum  (Holland  1981;  Rosenbaum  1984;  Holland  and  Rosenbaum  1986)  focus  on 
the  conditional  association  (CA)  of  subtest  scores  under  strict  unidimensionality,  and  Stout 
(1987,  1990)  treats  the  larger  class  of  essentially  unidimensional  (d^  =  1)  models,  focusing 
on  consistent  estimation  of  0  using  proportion  correct  in  long  tests.  In  the  present 
paper  we  investigate  the  intersection  of  these  two  approaches,  using  Stout’s  principle  that 
any  reasonable  set  of  items  X_j  can  be  embedded  in  an  infinitely  long  sequence  of  items  X 
of  the  same  character. 

We  introduce  three  concepts  which  are  helpful  in  the  search  for  such  a  characterization. 
First,  we  consider  only  representations  which  are  minimally  useful  in  the  sense  that  (1)  the 
latent  trait  0  can  be  consistently  estimated  from  the  item  responses;  (2)  0  is  monotonically 
related  to  the  test’s  “true  score;”  and  (3)  0  is  not  constant  in  the  examinee  population. 
Second,  we  argue  that  the  condition  Cov  {Xi,Xj\Xj)  <  0  is  a  natural  one  to  add  to  CA  and 
df;  =  1  to  ensure  that  the  items  au-e  locally  independent  with  respect  to  0.  Third,  we  show 
that  the  monotonicity  of  the  empirical  ICC’s  P[Xj  =  l|Xj  —  Xj/J]  is  intimately  related  to 
ICC  monotonicity:  this  “manifest  monotonicity”  must  hold  if  d^,  =  1  holds;  and  conversely 
it  can  be  used  to  verify  monotonicity  of  the  usual  ICC’s  Pj{0)  when  df:  =  1  holds. 

We  obtain  a  nearly  complete  nonparametric  characterization  of  useful  djr,  =  1  represen¬ 
tations  in  terms  of  CA,  d^;  =  1,  manifest  monotonicity,  and  the  above  negative  covariance 
condition.  The  negative  covariance  condition  may  not  be  strictly  necessary  for  all  d^,  =  1 
representations,  but  we  show  with  a  small  simulation  study  that  it  probably  does  hold  for 
di,  =  1  models  often  considered  in  practice:  the  Rcisch,  2PL  and  3PL  models. 

Key  Words:  strict  unidimensionality,  essential  unidimensionality,  useful  models,  condi¬ 
tional  association,  negative  association,  empirical  item  characteristic  curves,  monotonicity, 
simulation. 


1 


Contents 


1  Introduction  3 

2  Conditional  association  and  essential  independence  8 

2.1  Conditional  association .  8 

2.2  Essential  independence .  9 

2.3  Combining  CA  and  =  1 .  11 

3  Two  new  conditions  14 

3.1  CA,  d£  =  1,  and  useful  unidimensional  models .  14 

3.2  CSN:  A  natural  negative  covariance  condition .  16 

3.3  MM:  Manifest  monotonicity .  18 

4  Characterization  of  =  1  21 

5  Discussion  24 

A  The  “necessity”  of  LI,  M  and  D  26 

B  Limits  on  generalizing  MM  28 

C  An  illustration  of  CSN  29 

References  33 


NTIS  ORAftt 

DTIC  TAB 

□ 

Unannounced 

□ 

Justiricatloo _ 

r" 


By - - 

D1 strlbutlon/ 


ability  Codes 


Diet 


Avail  and/or 
Special 


1  Introduction 


Item  response  theory,  IRT,  is  a  modern  attempt  to  model — and  statistically  analyze — 
examinee  responses  on  standardized  achievement  or  aptitude  tests.  IRT  modeling  and  anal¬ 
ysis,  which  occurs  at  the  level  of  individual  test  questions — items — is  greatly  facilitated  by 

I 

the  assumption  of  unidimensionality,  i.e.  that  the  latent  trait  ‘‘driving’'^  the  item  responses 
is  a  one-dimensional,  typically  real-valued,  random  variable.  Birnbaum  (1968)  and  Lord 
(1980)  provide  complete  accounts  of  traditional  unidimensional  IRT.  In  this  paper  we  are 
concerned  with  a  general  characterization  of  (the  distributions  of)  item  response  data  for 
which  traditional  unidimensional  IRT  representations  exist. 

For  our  purposes,  a  test  is  simply  a  vector  of  J  items,  or  equivalently  J  binary  (0/1)  item 
response  vai'  bles,^’ 

. _ Xj  =  {XuX2,...,Xj), 

representing  the  correctness  of  responses  of  a  randomly-chosen  examinee  to  the  J  test  items. 
Let  Xj  represent  an  arbitrary  fixed  outcome  of  20?  3.  response  pattern;  an  IRT  model 

makes  assumptions  on  the  conditional  distribution  P\X  ?  =  xj  ISl  =  £]  which  impose  restric¬ 
tions  on  the  marginal  distribution  P[X  j  =  through  the  integral 

P[Xj  =  xj]=  j  P[Kj  =  =  0]  dF[0).  (1) 

Here  F{&)  is  the  sampling  distribution  of  the  latent  trait  or  trait  vector  0  =  (0i, ....  0^)  in 
the  examinee  population  under  discussion;  thus  our  point  of  view  is  similar  to  that  of  Cressie 
Holland  (1983). 

VVe  can  estimate  the  marginal  distribution  P\X ,  =  by  examining  the  response  data 
from  an  actual  test  administration.  We  will  call  this  the  manifest  structure  of  the  test. 

since  it  can  be  identified  to  arbitrary  accuracy  by  increasing  the  manifest  data:  in  this  case. 

% 

by  increasing  the  number  of  examinees  observed.  On  the  other  hand  neither  the  rr.arginal 
distribution  F{^  nor  the  conditional  distribution  P[Xj  =  3lj|0  =  0]  is  directly  obsf'rvable  to 
us,  in  the  sense  that  neither  quantity  is  fully  identifiable  without  also  increasing  the  number 
of  itemsL  These  quantities  determine  the  latent  structure  of  the  test. 

’It  is  possible  in  principle  to  decide  whether  the  Rasch  rnodel  holds,  without  more  items,  using  the  special 


3 


The  representation  in  (1)  does  not  itself  restrict  the  distribution  of  item  responses  P[X  r  = 
in  any  way.  Standard  IRT  practice  involves  the  imposition  of  additional  conditions  that 
maJce  (1)  a  restrictive,  and  hence  meaningful,  representation.  It  then  becomes  a  meaningful 
and  important  question  to  ask  whether  the  model  so  proposed  “fits”  the  observed  response 
data. 

The  traditional  IRT  assumptions  are  that  local  independence  holds, 

^’ii;=sji0=a  =  n/=.FiA-,  =  xii©  =  a 

and  that  monotonicity  holds, 

Pj{^)  =  P[^j  —  1|0  =  ^]  coordinatewise  nondecreasing  in  6_,  V  j  (M) 

in  the  sense  that  if  <  6^^^  for  all  k  —  1,2,  ...,d  then  <  Pj{0^^^).  When  0  is 

one-dimensional,  Pj{0)  will  be  called  an  item  characteristic  curve,  ICC. 

One  additional  assumption  is  needed  to  make  (1)  restrictive,  namely  that  the  dimension¬ 
ality  d  of  ^  is  much  smaller  than  the  test  length  J  (see  for  exjimple  Holland  and  Rosenbaum, 
1986),  that  is; 

d  <  J.  (D) 

(In  the  development  that  follows,  this  is  formalized  by  requiring  that  d  remain  fixed  ais  J 
grows.)  The  three  assumptions,  LI,  M,  and  D  form  the  foundation  of  item/test  modeling  in 
traditional  IRT.  Appendix  A  gives  examples  to  show  that  if  any  of  these  three  assumptions 
is  completely  omitted  the  resulting  “model”  will  fit  any  distribution  of  binary  data  (hence 
making  it  scientifically  meaningless).  The  least  d  for  which  the  representation  (1)  holds  and 
satisfies  LI  and  M  (and  smoothness  of  the  IRF’s)  we  will  denote  d^.  We  will  refer  to  the 
case  in  which  d^,  =  1  as  the  strictly  unidimensional  Ccise. 

Various  special  cases  of  the  strict  unidimensionality  assumptions  have  been  investigated 
to  see  what  properties  they  imply  for  the  manifest  distribution  P[X j  =  Xj]  through  (1). 
Holland  and  Rosenbaum  (Holland,  1981;  Rosenbaum,  1984;  Holland  and  Rosenbaum,  1986) 

relationship  of  this  model  with  log-linear  models  (c.f.  Cressie  and  Holland,  1983;  Tjur,  1982).  However  this 
does  not  appear  to  be  possible  for  other  models,  nor  for  the  general  question  of  unidimensionalily  prior  to 
parametric  model  selection  which  concerns  us  here. 


4 


have  shown  that  when  the  general  di,  =  \  assumptions  hold,  the  items  Xj  must  be  con¬ 
ditionally  associated-,  this  shows  that  =  1  is  a  restrictive  and  hence  meaningful  set  of 
conditions.  Cressie  and  Holland  (1983)  (se  also  Tjur,  1982),  have  characterized  the  Rasch 
model  in  terms  of  a  suitably  restricted  log-linear  model  for  P[2Lj  =  ^]-  And  de  Finetti’s 
Theorem  in  classical  probability  theory  may  be  used  to  characterize  an  infinite  sequence  of 
items  with  identical  item  response  functions  by  the  property  that  the  items  themselves  must 
be  exchangeable. 

Stout  (1987;  1990)  capitalizes  on  the  good  6-estimation  properties  of  the  proportion  cor¬ 
rect  score  Xj  =  j  Xj,  when  J  is  large  and  d/,  =  1  holds,  to  produce  a  statistical  test  of 
latent-trait  unidimensionality.  Stout’s  statistical  test  is  tailored  to  his  essential  unidimen¬ 
sionality  condition  (dg  =  1)  which,  in  contrast  to  strict  unidimensionality,  allows  there  to 
be  some  minor  dependencies  among  items  as  well  as  nonmonotonicities  of  individual  ICC's. 

Since  we  will  be  considering  latent  variable  representations  that  are  somewhat  more 
general  than  the  traditional  d^,  =  1  representation,  it  is  worthwhile  to  ask  what  constitutes 
a  “useful”  unidimensional  latent  variable  representation.  In  a  nonparametric  setting  we 
propose  that  such  a  representation  should  satisfy  the  following  definition.  Note  that  what 
we  mean  by  “useful”  here  relates  primarily  to  connecting  an  examinee’s  item  responses 
with  an  estimate  of  or  inference  about  his/her  latent  trait  score.  For  other  purposes,  other 
definitions  might  be  appropriate. 

Definition  1.1  An  IRT  representation,  in  which  LI  may  or  may  not  hold,  will  be  called 
useful  if  and  only  if  the  following  principles  are  satisfied: 


U1  0  can  be  estimated  from  the  observed  values  of  .Vi,  Xj, . . .  ,Xj.  At  minimum 
this  should  mean  that  there  are  functions  tj{xi,. . .  ,xj)  that  consistently 
estimate  0  in  the  sense  that 

tj{X^,...,Xj)^Q 

as  the  test  length  J  grows.  Moreover,  consistent  estimation  should  still  be 
possible  even  though  any  fixed  small  group  of  items  items  (Fi, . . . ,  Fjg)  in  X 
is  dropped  from  X- 


5 


U2  Examinees  with  higher  0  values  tend  to  score  higher  on  the  test.  A  very 
weak  condition  along  these  lines  is  simply  the  requirement  that  the  average 
ICC  Pj{0)  be  increasing  in  8  for  each  J;  in  other  words 

^[AjIO  =  6\  is  increasing  in  0. 

U3  0  is  useful  for  categorizing  examinees.  In  particular  0  should  be  able  to 
take  on  at  the  very  least  two  distinct  values,  each  with  positive  probability. 

These  principles  are  implicit  in  traditional  IRT  work,  and  are  easily  justified  on  practical 
grounds: 

First,  0  hais  little  statistical  value  as  a  index  of  ability,  achievement,  aptitude,  or  other 
latent  trait,  if  it  cannot  be  estimated;  hence  Ul.  There  is  no  hope  that  0  can  be  estimated 
with  high  precision  unless  J  — >  oo  (e.g.  the  survey  by  Fienberg,  1986),  so  Ul  represents,  in 
some  sense,  a  minimal  estimation  condition.  The  principle  that  estimation  of  0  should  not 
depend  strongly  on  which  particular  items  are  used  is  central  to  what  we  mean  by  “latent 
trait.” 

Principle  U2  reflects  the  interpretation  of  0  as  a  quantity  of  the  latent  trait,  and  of 
the  test  as  an  instrument  for  measuring  that  quantity.  It  will  be  seen  below  (Theorem  2.2) 
that  U2  also  makes  it  easier  for  the  representation  to  satisfy  Ul.  When  specific  parametric 
models  are  constructed  (recently,  e.g.,  Jannarone,  1986;  Sympson,  1987),  there  seems  to  be 
considerable  latitude  available  in  violating  U2  and  still  having  a  representation  which  satisfies 
Ul;  but  for  nonparametric  purposes  and  for  the  purpose  of  interpreting  0,  U2  is  appropriate. 
Note  that  U2  does  not  require  individual  ICC’s  to  be  monotone;  this  requirement  is  only 
made  of  the  test  characteristic  curve. 

Principle  U3  simply  reflects  the  practical  desire  to  use  the  test  to  diagnose,  assess  or 
otherwise  categorize  examinees  and  examinee  populations.  If  there  is  no  variation  in  0  then 
there  is  no  sensible  way  to  use  the  test  in  this  way.  In  terms  of  (1),  U3  asserts  that  the 
prior  0  distribution  does  not  concentrate  at  a  single  0  value. 

Except  for  the  special  cases  of  the  Rasch  model  and  de  Finetti’s  theorem,  no  other 
characterizations  of  =  1  in  terms  of  features  of  the  manifest  structure  P[Xj  =  seem 
to  be  known.  A  general  characterization  is  important  for  several  reasons.  First,  it  allows 


6 


US  to  better  understand  the  structure,  identifiability  and  meaning  of  the  representation  (1) 
under  di  =  For  example,  a  consequence  of  our  work  here  is  a  better  understanding  of 
how  “far”  each  of  the  CA  and  =  1  approaches  are  from  the  general  dx,  =  1  cissumptions. 
Second,  it  suggests  that  fit  tests  of  the  general  dx,  =  1  representation  are  possible,  without 
resorting  to  specific  parametrizations  of  the  ICC’s,  prior  0  distribution,  etc.  Hence  we  could 
distinguish  between  a  lack  of  fit  due  to  latent  trait  multidimensionality  and  a  lack  of  fit  due 
to  a  poor  choice  of  parametrization. 

In  this  paper  we  review  the  nonparametric  approaches  of  Holland/Rosenbaum  and  Stout, 
and  consider  some  new  conditions  which  bring  us  closer  to  a  general  characterization  of  strict 
unidimensionality.  Throughout  this  paper  we  embed  the  finite  test  Xj  in  an  infinite  sequence 
of  similar  items 

X  =  {XuX2,...). 

LI  and  other  traditional  IRT  properties  extend  in  a  natural  way  to  the  infinite  item  sequence 
X  by  requiring  that  they  hold,  in  a  consistent  fashion,  in  every  finite-length  test  Xj  taken 
from  X- 

In  addition  to  the  CA  and  dx;  =  1  assumptions,  we  argue  that  it  is  natural  to  require 
Cov  (Xj,  XjjXj)  <  0  to  ensure  that  the  items  are  locally  independent  with  respect  to  0.  Also, 
we  show  that  the  monotonicity  of  the  empirical  ICC’s  P[Xj  —  IjXy  —  XjfJ]  is  intimately 
related  to  ICC  monotonicity:  this  “manifest  monotonicity”  must  hold  if  dx,  =  1  holds;  and 
conversely  it  can  be  used  to  verify  monotonicity  of  the  usual  ICC’s  Pj{6)  when  d^  =  1  holds. 
(The  result  that  dx,  =  1  implies  manifest  monotonicity  hcis  been  discovered  independently 
by  I.  Molenaar  (priv.  comm.)  and  is  based  on  Grayson’s  (1988)  monotone  likelihood  ratio 
result  for  proportion  correct  scores.) 

Now  consider  representations  in  which  expected  values  of  the  form  £’[/(X_/)|0  =  0] 
are  continuous,  but  LI  may  or  may  not  hold.  Within  this  class  of  “smooth”  latent  trait 
representations,  we  can  characterize  useful  dx,  =  1  as  follows  (a  more  formal  statement  of 
the  result  is  given  in  Section  4). 

Characterization  of  dx,  =  1.  For  any  infinite  sequence  of  binary  items  X  and  latent  trait 
0,  a  useful  dx,  =  1  representation  (1)  holds,  if  and  only  if  the  following  conditions  hold:  CA. 


7 


dE  =  I,  manifest  monotonicity,  and 


Cov  (X.,Xj1Xj,0)  <  0,  V  i,j 


(2) 


Moreover  in  many  practical  settings  it  appears  that  the  covariance  in  (2)  continues  to  be 
negative  if  one  omits  the  conditioning  on  0,  because  Xj  is  “nearly  sufficient”  for  0.  In 
the  Rasch  model  for  example,  (2)  is  guaranteed  to  be  negative  when  0  is  omitted.  A  small 
simulation  study  is  reported  in  Appendix  C,  illustrating  similar  behavior  in  other  logistic 
IRT  models.  Thus,  although  (2)  spoils  a  characterization  in  terms  of  the  manifest  structure 
P[X  j  =  alone,  it  appears  we  are  quite  close  in  practical  situations. 


2  Conditional  association  and  essential  independence 

2.1  Conditional  association 

Holland  and  Rosenbaum  have  sought  covariance  conditions,  or  equivalently  probability  in¬ 
equalities,  in  the  distribution  of  2C/  which  must  be  satisfied  if  any  di  =  I  model  applies. 
The  starting  place  for  their  investigations  may  be  taken  to  be  coordinate-wise  nondecreasing 
functions  f{y)  of  finite  subtests  Y_=  (Ti, . . . ,  Yj^)  taken  from  X-  Examples  include 

•  The  weighted  average  /(K)  =  J2i°  ^  —  ^5 

•  The  “all  or  nothing”  score  /(VI)  =  01°  V); 

•  The  “at  least  one”  score  f{Y_)  =  max{yj  :  /  =  1,...,  Jo); 

•  Item  scores  f{Y_)  = 

The  coordinatewise  nondecreasing  functions  are  exactly  those  scoring  methods  which  assign 
more  credit  as  examinees  get  more  answers  correct. 

Under  LI,  any  two  such  scoring  methods  will  be  positively  correlated  at  each  fixed  ability 
level  t)  =  0  (Rosenbaum,  1984):  for  all  finite  subtests  Y_  taken  from  X  and  all  coordinatewise 
nondecreasing  functions  f{y)  and  g{y), 

Covif{Y),g{Y)\Q  =  e)>0,  (3) 


8 


for  each  possible  d.  This  condition  can  be  converted  into  a  condition  on  the  latent  structure 
into  to  a  condition  on  the  manifest  structure: 

Theorem  2.1  (Rosenbaum,  1984;  Holland  and  Rosenbaum,  1986).  If  X_  satisfies  di  —  1, 
then  2L  Is  conditionally  associated  (CA):  For  every  pair  of  disjoint,  finite  subtests  Y_  and 
Z  in  X_,  every  pair  of  coordinatewise  nondecreasing  functions  f{Y_)  and  g{y_),  and  every 
function  h{Z_), 

Cov  {f(Y.),g(Yi)\h{Z_)  —  c)  >  0  V  c  £  range(/i).  (CA) 

Intuitively,  a  =  1  test  possesses  so  much  internal  coherence  (the  item  responses  are 
driven  monotonically  by  the  single  latent  variable  0)  that  all  reasonable  subtest  scores  must 
be  correlated,  in  any  subpopulation  of  examinees  selected  by  any  criterion  /i(r)  relating  to 
another  part  of  the  test. 

Our  statement  of  Theorem  2.1  is  an  easy  extension  of  Holland  and  Rosenbaum’s  result 
for  finite  length  tests  to  infinite  item  sequences.  Note  also  that  for  finitely  many  binary  (or 
indeed  discrete)  random  variables  . .  ,Zm-,  conditioning  on  a  scalar-valued  function 

h{Z)  is  equivalent  to  Holland  and  Rosenbaum’s  practice  of  conditioning  on  vector-valued 
4(Z).  Seminal  special  cases  of  (3)  and  CA  were  developed  by  Holland  (1981). 

CA  represents  a  wide  variety  of  probability  inequalities  which  can  be  tested  in  the  man¬ 
ifest  distribution  P{2Lj  =  ^]-  When  CA  fails  the  items  cannot  be  treated  as  having  a 
di  =  1  latent  representation.  Apolications  to  studying  the  internal  coherence  of  a  set  of 
items  may  be  found  in  Rosenbaum  (1984)  or  Holland  and  Rosenbaum  (1986).  Related  work 
appears  in  Holland  (1981),  Rosenbaum  (1985),  Rosenbaum  (1987),  and  Rosenbaum  (19SS). 
An  application  of  CA  to  2issessing  the  dimensionality  of  standardized  tests  for  the  National 
Assessment  of  Educational  Progress  is  described  by  Zwick  (1987). 

2.2  Essential  independence 

A  successful  approach  to  identifying  unidimensional  latent  structure  outside  the  strict  d^^  =  1 
framework  has  been  pursued  in  the  seminal  work  of  Stout  (1997,  1990),  and  extended  by 
Junker  (1988,  1991).  The  main  idea,  which  borrows  from  both  the  ‘‘large  sample  theory” 


9 


tradition  in  mathematical  statistics  and  the  “factor  analysis”  tradition  in  psychometrics,  is 
that  of  essential  independence^ . 

For  any  (infinite)  sequence  of  dichotomous  items  X  =  {Xi,  X2,  ^2,  ■  ■  ■),  define  bounded 
item  scores  to  be  functions  Aj{Xj)  such  that  for  some  M  <  00,  \Aj{Xj)\  <  M  for  all 
j.  We  will  call  a  bounded  item  score  an  ordered  item  score  if  moreover  ^j(O)  < 

Define  a  bounded  test  score  to  be  the  average  of  the  first  J  bounded  item  scores  Aj  = 
j  Finally,  we  will  say  the  ordered  item  scores  are  asymptotically  discriminating 

if  ~  ^j(O)}  is  positive  and  bounded  away  from  0  as  J  00. 

The  infinite  item  sequence  X  is  essentially  independent  (El)  with  respect  to  0  if  and  only 
if 


lim  Var(ylj|0  =  ^)  =  0  (El) 

J—*oo 

for  all  bounded  test  scores  Aj.  When  El  holds,  Aj  is  a  consistent  estimator  of  the  “true 
score”  Aj{9)  =  E[Aj\Q  =  0],  as  J  — +  00.  In  particular,  for  a  sequence  of  dichotomous  items 
A'.  El  implies  that  the  proportion  correct  score  Xj  consistently  estimates  values  of  the  test 
characteristic  function  Pj{d),  as  J  ^  00. 

The  item  sequence  A  is  essentially  unidimtnsicnal,  for  whici  we  shall  write  d£  =  1,  if 
and  only  if  (a)  A  is  El  with  respect  to  a  unidimensional  0;  and  (b)  the  items  are  locally 
asymptotically  discriminating,  LAD;  for  every  set  of  ordered,  asymptotically  discriminating 
item  scores,  the  “true  score”  Aj{6)  is  nondecreasing  in  6,  in  the  strong  sense  that  to  every 
9  there  corresponds  an  interval  Ng  containing  9  and  an  >  0  such  that 


Aj{t)  —  Aj{9) 
t  -  9 


>eg,yt£Ng,t:f^9,\f  J. 


(LAD) 


If  no  such  unidimensional  0  exists,  we  write  d£;  >  1. 

It  is  not  apparent  from  the  above  discussion,  but  Stout  (1987)  and  Junker  (1988,  Section 
3.2),  make  it  clear  that  d^  =  I  can  be  checked  from  the  marginal  distribution  of  A.  A 
statistical  procedure  for  testing  the  hypothesis  that  a  set  of  J  items  Aj  comes  from  a  df;  =  1 
item  sequence  has  been  developed  by  Stout  (1987)  and  refined  by  Stout  and  Nandakumar 
(Nandakumar,  1987,  1990;  Nandakumar  and  Stout,  1990). 

When  d£  =  1,  Aj{9)  may  be  inverted  to  produce  estimates  of  9  directly: 

^Actually,  we  use  Stout’s  strong  essential  independence,  with  some  minor  changes  in  terminology  to  match 
Junker  (1991).  Any  of  the  three  variations  of  El  could  be  used;  cf.  Corollary  2.1  in  Section  2.3. 


10 


Theorem  2.2  (Stout,  1990).  If  the  item  sequence  X_  satisfies  ds  =  1  with  respect  to  0  then 
for  any  set  of  asymptotically  discriminating  item  scores. 


V  c  >  0,  lim  P 

J— »O0 


=  =  1, 


where  Aj'(u)  is  the  inverse  function  for  the  “true  score”  Aj{6). 


(4) 


Indeed,  under  the  conditions  of  Theorem  2.2  and  some  mild  smoothness  conditions,  the 
maximum  likelihood  estimate  of  6  calculated  as  though  LI  were  true  is  also  consistent  for 
6  (Junker,  1991).  Moreover  the  latent  trait  with  respect  to  which  =  1  holds  is  unique, 
up  to  a  monotone  transformation  (Stout,  1990).  It  is  valuable  to  think  of  El  as  the  greatest 
possible  weakening  of  LI  under  which  Ll-based  trait  estimation/prediction  schemes  could 
be  expected  to  work.  In  this  sense,  the  study  of  El  is  the  study  of  robustness  of  ability 
estimators  to  variations  from  an  LI  latent  structure.  Clarke  and  Junker  (in  progress)  pursue 
this  matter  in  a  more  general  setting. 


2.3  Combining  CA  and  df  =  1 

The  following  lemma  tells  us  that  under  =  1,  for  any  finite  subtest  E  in  Xj,  we 
may  approximate  expected  values  of  the  form  £^[/((K)]0]  with  expected  values  of  the  form 
P[f{y.)\oiJ  S:  Xj  <  fij],  3is  J  oo.  We  assume  for  the  remainder  of  the  paper  that 

P[C\Q  =  6]  is  continuous  in  0  (5) 

whenever  C  is  an  event  depending  on  only  finitely  many  A''’s.  Condition  (5)  implies  that 
E[f{Y.)\Q  =  t]  is  continuous  in  t  for  any  function  /(E)  of  finitely  many  items. 


Lemma  2.1  Suppose  X  satisfies  El  and  LAD  with  respect  to  some  unidimensional  0,  and 
assume  (5).  If  f{Yf)  is  a  function  which  depends  on  only  finitely  many  (fixed)  items  Y_  — 
(Vi, . . . ,  Vjo)  from  X_,  then  for  every  set  of  bounded  asymptotically  discriminating  item  scores 
Aj{Xj)  and  for  each  0  there  exist  ej  —*  0  for  which 


jim  £[/(!;)  \Aj'(Aj)-e\ 

J-^OO  L 


<  (J 


£[/(>:)  I©  =  ^]- 


11 


proof.  We  give  the  proof  in  the  Ccise  that  0  is  continuous,  but  a  similar  argument  may  be 
given  for  discrete  numerical  0. 

For  any  event  C,  let  Ic  take  the  value  1  if  C  is  true  and  0  if  C  is  false,  and  let  £[/{¥_);  C] 
=  E[f{Y_)lc].  We  may  decompose  the  expectation  on  the  left  above  as 

£[/(Z)||Aj'(Sj)-«|  <e] 

£(.«»;  |0-«l<e|  Elf(y)-.\A;'(Aj)-e\<e\  P||0 -«!<£] 

=  'pl|0-e|<'«|  '■  £[/(!:):  |0-«l<£|  P[\A:'(Aj)-l)\<e] 

=  I(£)  .  11(6)  ■  m(£). 

Note  that  for  any  rate  e  =  ej  0,  I(ej)  — >  E[f{yf)\Q  =  6],  as  J  — >  oo,  using  the  continuity 
condition  (5)  and  the  integral  mean  value  theorem.  The  idea  now  is  to  choose  e  =  ej  — >  0  so 
that  II  — ►  1  and  III  — »  1  as  J  oo.  We  will  look  at  II  explicitly;  note  that  III  is  a  special 
case  of  II.  We  have 

E{fiY)U^e-ei<c)} 

one  can  apply  Theorem  2.2  to  show  that  the  numerator  on  the  right  tends  to  zero  for  each 
fixed  e>0asJ— >oo;a  simple  diagonalization  argument  now  yields  a  rate  ej  -+  0  for  which 
II  — ♦  1.  A  similar  argument  works  for  III,  and  a  further  diagonalization  completes  the  proof. 
□ 

We  can  use  Lemma  2.1  to  gain  information  about  the  latent  structure  of  an  item  sequence 
X  from  the  manifest  condition  CA.  Proposition  2.1  shows  that  CA  and  ds  =  1  together  give 
the  same  local  association  condition  (3)  as  di,  =  1  alone. 

Proposition  2.1  Suppose  the  item  sequence  2L  satisfies  CA  and  df;  =  1,  and  suppose  that 
(5)  holds.  Then  (3)  holds: 

Coy{f{YfigiY)\e  =  0)>O 

for  all  0,  all  coordinatewise  nondecreasing  f  and  g,  and  all  finite  tests  Y_  taken  from  X. 

Remarks.  By  modifying  the  proof  of  Lemma  2.1  we  could  also  conclude  conditional  asso¬ 
ciation  given  0  =  d,  i.e.,  if  Z  were  a  finite  test  from  X  disjoint  from  Y_  and  h{Z)  were  any 
function,  then 

Cov(/(i:),^(r)|/i(z),0  =  d)>o. 


12 


proof.  Let  Y_  C  2Go  an  arbitrary  finite  test,  for  fixed  Jo,  let  W  =  (A^o+i, -^^0+2)  •  •  ■) 
and,  for  this  proof,  let  PjiO)  =  Using  CA  and  a  sequence  tj  obtained  from 

Lemma  2.1, 

0  <  Cov  [/(i:),y(y:)||?j’(H7^)_<l|<ejj 
-*  Cov{f{Y),g{Y)\e  =  ») 

as  J  — >■  oo.  □ 

Let  us  digress  briefly  to  indicate  another  way  in  which  CA  and  Jg  =  1  interact  well. 
Two  alternative  definitions  of  El  have  been  proposed  by  Stout  (1990),  one  involving  the  full 
sequence  2L  but  taking  absolute  values  of  covariances, 

Jim  (  ^  )■'  |Cov(X.,  =  0,  (6) 

and  another  involving  “nonsparse  subtests”  which,  in  the  present  context,  is  equivalent  to 
considering  only  those  asymptotically  discriminating  item  scores  for  which  -j(0)  =  0  and 
A.j[\)  G  {0,1},  and  requiring 

Jim  (  2  )"'  EE  Cov(>l(A:,),/l(Xi)l«)  =  0.  (7) 

It  is  not  known  in  general  whether  these  three  definitions  are  equivalent.  However,  under 
CA  they  are: 

Corollary  2.1  If  CA  and  LAD  hold  for  X_,  then  all  three  definitions  o/EI  are  equivalent. 

proof.  Condition  (6)  implies  El  as  defined  in  Section  2,  which  in  turn  implies  (7),  since 
each  condition  is  a  special  case  of  the  preceding  one.  For  the  converse  directions,  observe 
that  if  LAD  holds  (for  the  restricted  Ccise  of  Aj{Xj)  €  {0, 1)  Vj),  then  by  Proposition  2.1. 
Cov  (Ai,  Xjl^)  >  0,  Vi,j.  In  this  case,  (6)  becomes  a  special  case  of  (7),  and  we  are  done.  □ 

Returning  to  our  main  development,  the  next  proposition  complements  Proposition  2.1  by 
characterizing  LI  in  terms  of  quantities  that  could  in  principle  be  approximated  by  manifest 
quantities  E[f{Y_)\aj  <  Xj  <  0j]  as  in  Lemma  2.1  under  El  and  LAD. 

Proposition  2.2  The  item  sequence  X_  satisfies  LI  with  respect  to  0  if  and  only  if  the 
following  two  conditions  hold: 


13 


For  all  d,  all  nondecreasing  f  and  g,  and  all  finite  tests  Y_  taken  from  X , 


Cov(/(y), 5(11)10  =  0)>O; 


(S) 


For  all  6,  i,  and  j, 


Cow{Xi,x,\e  =  e)  <0. 


(9) 


Remarks.  Note  that  (8)  says  that  X_  is  associated  (as  defined  by  Esary,  Proschan  and 
Walkup,  1967),  given  0  =  0,  for  all  6 — indeed  (8)  is  exactly  the  same  as  (3). 


proof.  That  LI  implies  (8)  follows  from  Esary,  Proschan  and  Walkup  (1967);  (9)  is  trivially 
satisfied  under  LI  with  Cov  (X,,  .Yj|0)  =  0.  For  a  proof  of  the  converse,  in  unconditional 
form,  see  Newman  and  Wright  (1981)  or  Joag-Dev  (1983).  □ 


3  Two  new  conditions 

Propositions  2.1  and  2.2  suggest  that  if  we  require  both  CA  and  d£  =  1  in  the  item  sequence 
X,  we  are  not  far  away  from  a  strictly  unidimensional  representation.  Indeed,  by  Proposi¬ 
tion  2.1,  we  know  that  CA  and  dg  =  1  will  guarantee  (8).  In  this  section  we  will  explore 
conditions  on  P[X  i  =  ij]  that  will  guarantee  (9)  also.  We  also  show  that  it  is  possible  to 
check  for  monotone  ICC’s — a  condition  not  explicitly  provided  for  by  CA  and  dg  =  1 — by 
examining  the  “empirical  ICC’s”  P[Xj  =  l|A'^j  —  Xj/J]. 

3.1  CA,  dE  =  1,  and  useful  unidimensional  models 

It  can  be  seen  from  Theorem  2.1  and  the  discussion  following  the  definition  of  essential 
independence  that  CA  and  El  are  both  necessary  conditions  for  the  sequence  of  items  A  to 
have  a  =  1  representation.  In  this  section,  we  consider  several  “thought  examples"  which 
suggest  to  what  extent  these  two  conditions  suffice  to  characterize  d^,  =  1.  We  restrict 
our  attention  to  useful  di  =  \  models,  as  defined  in  Section  1.  Because  LAD  formalizes 
principle  U2  and  Theorem  2.2  satisfies  principle  Ul,  any  d^  =  1  model  in  which  0  varies 
in  the  examinee  population  is  useful  in  the  sense  of  Definition  1.1. 


14 


Example  3.1  CA  is  not  a  guarantee  that  a  useful di,  =  1  model  exists.  Suppose  {Xi,X2,  ■  •  •) 
are  independent  (unconditionally).  Then  CA  follows  from  a  result  of  Esary,  Proschan  and 
Walkup  (1967).  Now,  any  O  that  we  can  estimate  using  principle  U1  will,  according  to 
the  0-1  Law  of  probability  theory  (e.g.  Ash,  1972,  p.  278),  fail  to  vary  from  examinee  to 
examinee.  This  violates  principle  U3;  hence  no  useful  latent  trait  model  for  this  CA  se¬ 
quence  exists.  (Of  course  this  is  no  surprise,  since  a  fully  independent  sequence  of  response 
variables — e.g.  coin  flips — should  intuitively  not  be  able  to  tell  us  anything  useful  about  a 
latent  trait  anyway!)  □ 

Example  3.2  d£;  =  1  is  not  a  guarantee  that  a  useful  di,  =  1  model  exists.  Stout  (1990, 
Example  2.3)  gives  a  model  for  a  sequence  of  “paragraph  comprehension”  questions  which 
is  a  useful  d^  =  1  model.  We  shall  show  that  no  useful  =  1  model  can  be  formulated 
for  this  sequence  of  paragraph  comprehension  items.  Indeed,  if  items  Xi  and  Xj  refer  to 
the  same  reading  passage  we  expect  Cov  (X,-,Xj|0  =  ^o)  ^  0  for  some  Oo-  Now  suppose,  by 
way  of  contradition,  that  there  exists  a  unidimensional  latent  trait  t  with  respect  to  which 
LI  and  LAD  hold;  in  particular  Cov(A^,Xj|T)  =  0.  Now  by  Stout’s  unique  trait  theorem 
(Theorem  3.3  of  Stout,  1990),  r  would  be  a  monotone — indeed  invertible — transformation 
of  0,  r  =  ^(0).  But,  taking  to  =  g(0o), 

0  =  Cov  (ATi,  A'j  |r  =  <o) 

=  Cov(A.,A,l5(0)  =  9(^o)) 

=  Cov(A„X,|0  =  0,) 

#  0. 

This  contradiction  shows  that  no  such  r  can  exist,  i.e.  no  useful  =  1  model  exists  for  the 
sequence  of  paragraph  comprehension  items.  □ 

Example  3.3  CA  and  dg  =  1  together  may  not  guarantee  that  a  useful  d^,  =  1  model 
exists.  Let  A  be  an  item  sequence  satisfying  df;  =  1  with  respect  to  s  me  unidimensional 
0.  We  may  imagine  the  0  in  this  dg  =  1  representation  cis  being  the  first  coordinate  of  the 
latent  trait  vector  0  =  (0,  02,  03, . . . ,  Qd)  needed  for  &  dc  =  d  representation: 

P[2(j  =  xj\Q  =  e]  =  I  P[Xj  =  x^i©  =  e]f{o\Q  =  e)dd 


15 


for  each  J,  where,  by  LI  with  respect  to  0  =  (0, 02, 03, . . . ,  0<i), 


P[Xj  =  ^|0  =  ^]  =  n  -  Pmf- 

J=1 


In  general,  although  ds  =  I  means  that 


Jim  (  ^  Cov(2.,Z,10  =  «)  =  0 


for  every  nonsparse  subsequence  Z_  from  the  individual  covariances 

Cov(Z.,Z,|0  =  6)  =  Cov(/^(0),Pj(0)|0  =  9) 

may  be  positive  or  negative,  depending  on  how  the  traits  02,  03, . . . ,  0^  interact  with  0. 
Now  suppose  X_  satisfies  CA  also.  Then  by  Proposition  2.1, 


Cov(X.,Xj|0  =  0)>O,Vz>i;  (10) 

in  fact  the  stronger  condition  (3)  holds.  Thus,  not  only  is  0  the  dominant  trait  for 
but  the  “minor  traits”  needed  for  LI  to  hold  are  concordant  with  0,  in  the  sense  that  they 
interact  with  0  so  as  to  keep  the  local  inter-item  covariances  nonnegative. 

Under  CA  and  =  \,  therefore,  there  is  enough  coherence  among  the  items  that 
covariances  between  items,  given  0  =  0,  are  nonnegative.  Indeed  it  is  quite  plausible  that 
under  these  conditions,  for  some  tests,  some  of  the  inequalities  in  (10)  will  be  strict  (despite 
its  plausibility  we  have  not  been  able  to  construct  an  example  in  which  this  may  rigorously 
be  shown).  But  if  any  of  the  inequalties  (10)  are  strict  for  a  dominant  latent  trait  0  with 
respect  to  which  df;  =  1,  then  there  cannot  exist  any  other  unidimensional  trait  r  with 
respect  to  which  a  useful  di  =  \  model  exists;  this  follows  by  the  same  argument  as  in 
Example  3.2.  □ 


3.2  CSN:  A  natural  negative  covariance  condition 

Thus  some  condition  in  addition  to  CA  and  ds  —  I  seems  to  be  needed  to  get  a  useful  di  =  \ 
model.  When  LI  holds,  we  know  that  (3)  holds  also:  Cov (/(y),^(T)|0  =  0)  >  0  for  all 
finite  subtests  Y_  and  nondecreasing  functions  /  and  g.  As  indicated  in  Example  3.3,  there 


16 


may  be  some  situations  in  which  El  and  CA  hold  but  the  implied  coherence  among  items  is 
so  tight  that  LI  cannot  also  hold.  What  is  needed  is  a  condition  that  loosens  this  coherence 
so  that,  despite  (3),  individual  item  pairs  have  zero  covariance. 

An  interesting  condition  derived  by  Joag-Dev  and  Proschan  (1982)  implies  that  when  LI 
with  respect  to  0  does  hold,  then 

Cov(X.,A:j1Xj,0)  <  0  (LCSN) 

for  all  i  <  j  <  J.  This  says  that,  under  LI,  the  items  are  “not  too  tightly  bound  together” 
even  though  (2)  holds;  each  Xi  and  Xj  are  sufficiently  free  of  one  another  among  examinees 
of  the  same  ability  that  when  Xi  increases  from  one  examinee  to  the  next,  X;  is  free  to 
decreaise  so  that  the  test  score  Xj  may  be  kept  constant.  The  abbreviation  LCSN  stands 
for  locally,  covariances  given  test  scores  are  negative. 

However,  LCSN  is  a  condition  on  the  latent,  not  the  manifest,  structure.  To  obtain  a 
natural  manifest  structure  analogue  to  LCSN  it  is  useful  to  consider  the  special  case  of  the 
locally  independent  Rasch  model.  Here  Xj  is  sufficient  for  0,  i.e.  (Xi,  . . . ,  Xj)  are 
independent  of  0  given  Xj.  One  consequence  of  this  is  that 

Cov(X.,X,|Xj)  =  Cov(X.-,X,|Xj,0),  (11) 

in  which  caise  LCSN  is  equivalent  to  the  manifest  condition 

Cov(X.,Xj|Xj)  <  0  (CSN) 

for  all  i  <  J  <  J.  Here  CSN  should  be  read  2is  covariances  given  test  scores  are  negative.  In 
practice  it  may  be  necessary  to  allow  (CSN)  to  be  violated  for  very  small  values  of  J,  e.g. 
J  <  10. 

Because  Xj  is  not  sufficient  for  0  outside  the  Rasch  model,  CSN  is  an  imperfect  substitute 
for  LCSN.  It  is  valuable  to  know  how  closely  related  LCSN  and  CSN  are  under  LI.  Suppose 
LI  and  hence  LCSN  holds.  To  examine  CSN  one  might  consider  the  decomposition 

Cov(X.,X,|Xj)  =  E  [Cov(X.,X,|Xj,0)|Xj]  +Cov  [e[X.|Xj,  0],  E[X,|Xj,  0]|  Xj]  . 

The  first  term  on  the  right  is  nonpositive,  by  LCSN.  The  second  term  may  be  negative  or 
positive,  but  should  be  small  since  under  =  1  the  (posterior)  distribution  of  0  given  A'j 


17 


should  have  very  low  variance,  as  J  grows.  Some  preliminary  work  of  B.  Clarke  and  J.  K. 
Ghosh  (Clarke,  priv.  comm.)  points  toward  a  proof  of  this  assertion.  This  suggests  that  in 
most  di  =  I  models 


Cov(x.-,x,|Xj)«f: 


Cov(X.,X,|Xj,0) 


(12) 


for  longer  tests,  even  though  Xj  is  not  sufficient  for  0.  In  particular  when  Cov  (A',,  Xj|A^j) 
fails  to  be  negative  for  a  LI  model,  we  at  letist  expect  it  to  be  near  zero.  Thus  one  might 
consider,  not  CSN,  but  rather  a  condition  like 


Cov(Ai,X,lAj)  <  0  +  cj 


(13) 


for  suitably  chosen  tj  >  0;  this  is  similar  to  including  an  indifference  region  in  a  test  of 
the  null  hypothesis  CSN  against  a  general  alternative.  The  plausibility  of  CSN  and  (13)  in 
locally  independent  two  parameter  and  three  parameter  logistic  IRT  models  is  illustrated  in 
a  small  simulation  in  Appendix  C. 


3.3  MM:  Manifest  monotonicity 

When  one  desires  to  check  M  in  practical  situations,  a  condition  like  the  following  is  often 
used.  Let  Xij  =  X j  ~  X,f  J,  we  will  say  manifest  monotonicity,  MM,  holds  if 

£'[A'',|A’,j]  is  nondecreasing  in  Xij  (MM) 

for  all  i  <  J  (and  all  J).  This  intuitively  appealing  monotonicity  check  is  intimately  related 
to  dc  =  I  latent  structure,  in  that 

(a)  LI,  M  =>  MM; 

(b)  El,  LAD,  MM  =►  M;  and  hence 

(c)  under  LI  and  LAD,  MM  M. 

Assertions  (a)  and  (b)  are  proved  in  Proposition  3.1,  and  (c)  is  an  immediate  corollary.  Mole- 
naar  (priv.  comm.)  has  independently  discovered  (a),  and  we  report  examples  of  Molenaar 
and  Snijder  in  Appendix  B  that  indicate  the  limitations  of  the  method  of  proof  for  (a). 


18 


Lemma  3.1  If  LI  and  M  hold  for  an  item  sequence  X_  and  latent  trait  0,  then  0  is  stochas¬ 
tically  increasing  in  S  =  Xij: 

W  a  <b\/ c:  P[e  >  c\S  =  a]<  P{e  >  c|5  =  6]  (14) 


whenever  the  conditional  probabilities  are  defined. 

proof.  We  may  apply  Grayson  (1988),  Theorem  2,  to  the  subtest  (Xi,  . . X,_i,  . . .. 

Xj)  to  see  that  the  score  S  =  Xu  has  the  monotone  likelihood  ratio  property 

pf  C  _  U\Q\ 

^ab{d)  =  -577; - nondecreasing  in  6,  ^  a  <  b,  (15) 

.1  [O 

whenever  the  conditional  probabilities  are  defined.  To  establish  (14)  (for  any  score  S  satis¬ 
fying  (15)),  we  may  write  its  left  hand  side  a^ 

r+00 

/  P[S  =  a\e]dF(9) 

=  a]  =  7+00'  ■- - - - 

/  p{s  =  a\e]dF{e) 

J —00 

=  p[r>c] 

where  T  is  a  random  variable  with  density  proportional  to  P[S  =  a\t]  ■  dF{t).  (I.e.,  T  = 
[0|5  =  a].)  On  the  other  hand,  the  right  hand  side  of  (14)  may  be  written  as 

/  p[s  =  b\e]dF(e) 

P[0  >  c|5  =  6]  =  ^ - 


P{d  >  c\S 


/+00 

P{S  =  b\0]dF{0) 

-00 

/•+00 

/  P[S  =  a\0]R,k{0)dF{0) 

^+00  ~ 

/  P[S  =  a\0]RMdF{0) 

J  —00 


_  £'[Ha6(r)l{r>c}] 

-  E\R,,{T)] 

for  the  same  random  variable  T  (where  Ic  is  the  function  that  takes  the  value  1  when  C  is 
true  and  0  otherwise).  Hence  (14)  is  equivalent  to  the  assertion  that 


P[T  >  c]  ■  E[R,t{T)]  <  E[Rab{T)l{T>c}] 

which  follows  from  property  (P3)  of  Esary,  Proschan  and  Walkup  (1967),  since  g{T)  =  1{t>c) 
and  h{T)  =  Rab[T)  are  both  nondecreasing  functions  of  T .  □ 


19 


Proposition  3.1 

(a)  LI,  M  =>  MM 

(b)  El,  LAD,  MM  and  (5)  M 

proof,  (a)  Note  that 

E[X,\Xu] 


E[E[Xi\Xu,Q] 

e:[p.(0)!x.j] 


a: 


ij 


(16) 


by  LI.  This  last  expectation  is  nondecreasing  in  Xu  by  M  and  Lemma  3.1,  using  a  result  of 
Lehmann  (1955). 

(b)  Let  then  there  exist  sequences  Qj  ^  <  I3^j^  and  Oj  ^  with  <  Qj  ' 

for  all  large  J,  such  that  <  Xu  <  l^j^}  =  ~  ^^’^1  <  from  Lemma  2.1. 

Then 

e;[;c.|0  =  =  limE:[X.la<;)<X.v</5y^] 

J— *00 

<  lim  E[X,\a^p  <  Xu  <  0^^] 

J-^OO 

=  £[X.|0  =  ^<2)] 


where  the  middle  inequality  follows  from  the  fact  that 


£[X,|aS''  <  Xu  <  -lift'- 

Pla'J'  <  Xu  <  ’ 

«(J)  _  _ 

Yl  ^  (,)E[A-.|X.y=c]P[X.^=c] 

=  , _ 

^  (I) 
c=o^ 

-  - 

X:  (,)E[X.|A'.^=c]P[X.j=c] 

<r  _ ,  , _ 


(i)i 


XI  ^  (2) 

(2)  ^  V. 


=  E[XMj<^u<0j 


?(2)l 


(under  MM,  the  second  ratio  of  sums  above  is  a  weighted  average  of  larger  conditional 
probabilities  than  the  first  one).  □ 


Corollary  3.1  Under  LI,  LAD  and  (5)  we  have 


MM  ^  M 


20 


4  Characterization  of  =  1 

The  previous  two  sections  may  be  summarized  in  the  following  theorem. 


Theorem  4.1  If  2L  is  an  item  seqence  satisfying  di  =  1  and  if  LAD  holds  with  respect  to 
the  latent  trait  0,  then  each  of  the  conditions  CA,  ds  =  1,  LCSN  and  MM  hold. 

We  show  in  this  section  that  the  converse  is  also  true:  the  four  conditions  CA,  =  1, 
LCSN  and  MM  guarantee  a  useful  di  =  I  representation;  hence  these  conditions  characterize 
useful  di  =■  \  representations.  Moreover  the  converse  implication  is  still  true  if  LCSN  is 
replaced  with  its  manifest  analogue  condition  CSN. 

The  main  problem  with  obtaining  these  two  converses  of  Theorem  4.1  is  that  our  negative 
covariance  criteria  CSN  and  LCSN  use  conditioning  on  fixed  values  of  Xj  only,  whereas 
Lemma  2.1  gives  approximations  to  (8)  and  (9)  which  require  conditioning  on  intervals 
otj  <  Xj  <  0j.  The  next  lemma  connects  these  two  forms  of  conditioning.  We  will  assume 
that  for  each  J  and  i  <  J  there  exist  differentiable  gij  such  that 


E[Xi\Xj]=gij{Xj)  I 

sup,,j,„  l5.v(«)i  <  M  <  oo  J 

and  that  for  each  J,  i  <  J,  and  0  there  exist  differentiable  gue  such  that 

E[X,\Xj,e  =  e\  =  gu0{Xj)  1 
sup.,j,„  <Me<oo  j 


(IT) 


(IS) 


The  conditions  (17)  and  (18)  are  maximum  discrimination  conditions  on  item-test  regres¬ 
sions;  most  likely  they  would  be  acceptable  in  practice.  In  particular,  note  that  (17)  and 
(18)  do  not  hide  a  monotonicity  assumption. 

Lemma  4.1  (a)  Suppose  CSN  holds,  and  suppose  (17)  also  holds.  Then  for  any  constants 

Qj  <  0j  for  which  the  covariances  are  defined,  and  for  which  3j  —  ctj  — >  0, 


limsupCov(Ai,  A''j|aj  <  Xj  <  0j)  <  0. 

j—*oo 


(19) 


21 


(b)  Suppose  LCSN  holds,  and  suppose  (18)  also  holds.  Then  for  any  constants  aj  <  Sj 
for  which  the  covariances  are  defined,  and  for  which  0j  —  aj  0, 

limsupCov(X.,X,|aj  <Xj<  l3j,9)  <  0.  (20) 

J— ►OO 

proof.  We  will  do  part  (a)  only;  part  (b)  is  virtually  identical.  We  have 

Cov{Xi,X,\aj<Xj<0j)  =  E[CoviXi,X,\Xj)\aj<Xj  <0j] 

+  Cov  [E[X,\Xj],  E[X,\Xj]\aj  <Xj<0j\. 

The  first  term  on  the  right  is  evidently  nonpositive  for  J  large.  Let  us  drop  the  conditioning 
on  q;j  <  Xj  <  j3j  from  the  notation  for  brevity;  then  the  second  term  in  (21)  is 

Coy[E[X,\Xj],E[Xj\Xj]]  =  Cov[gu(Xj),g,j{Xj)] 

<  {W^.Tg,J(XJ)■y^ivg,J(XJ)y^, 

by  the  Cauchy-Schwarz  inequality.  Now  applying  Taylor’s  theorem, 

Var^.j(Xj)  <  [max„lg-j(u)lP  •  VarXj 

so  that  conditioning  on  aj  <  Xj  <  fSj,  which  forces  Wa,T  Xj  — >  0  and  hence  Va.T gij{X j)  — +  0 
as  J  — ♦  oo,  also  forces  the  second  term  in  (21)  to  go  to  zero,  completing  the  proof.  □ 

Now  we  are  ready  to  state  and  prove  the  tw'o  converses  to  Theorem  4.1.  The  formal 
statement  of  our  characterization  of  useful  =  1  models  is 

Theorem  4.2  Suppose  X_  is  an  item  sequence  and  0  is  a  unidimensional  trait,  and  suppose 
(5),  (17)  and  (18)  hold.  Then 

(a)  CA,  dE  =  I,  LCSN,  MM  =  1,  LAD 

(b)  CA,  ds  =  1,  CSN,  MM  ^dL  =  l,  LAD 

Remarks.  In  the  implications  “=>”  in  (a)  and  (b),  0  is  the  trait  with  respect  to  which 
dg  =  I  holds,  and  the  theorem  asserts  that  in  fact  d^,  =  1  holds  with  respect  to  this  0.  In 
the  implication  “4=”  in  (a),  0  is  the  trait  with  respect  to  which  di  =  1  holds.  In  both  cases, 
0  is  unique  up  to  monotone  transformation  (Stout,  1990,  Theorem  3.3). 


22 


proof.  It  is  more  convenient  to  prove  (b)  first. 

Part  (b),  There  are  three  conditions  to  check  on  the  right:  LI,  M,  and  LAD. 

LAD  follows  from  df;  =  1  by  dehnition.  M  follows  from  MM  and  df;  =  1  via  Proposi¬ 
tion  3.1(b).  LI  follows  from  CA,  ds  =  1  and  LCSN,  using  Proposition  2.1,  Proposition  2.2, 
and  Lemma  4.1(a),  since  (19)  implies  that  under  CSN  and  d^  =  1  we  have  Cov(A^,  Xj  |0)  < 
0  for  all  t,  j  and  0. 

Part  (a),  Again  we  must  check  LI,  M  and  L.AD.  LAD  and  M  follow  as  before. 

LI  follows  again,  using  Proposition  2.1,  Proposition  2.2,  and  Lemma  4.1(b),  since  now  (20) 
implies  that  under  LCSN  Cov  (A"",,  Aj|d)  <  0  for  all  i,  j,  and  6  (a  conditional  [given  0  =  0] 
form  of  Lemma  2.1  is  needed  to  show  this,  but  this  is  straightforward). 

Part  (a),  This  is  Theorem  4.1,  but  we  state  the  proof  for  completeness.  We  must 

check  MM,  CA,  El,  LAD  and  LCSN.  MM  and  CA  follow  from  di,  =  1  by  Proposition  3.1(a) 
and  Theorem  2.1,  respectively.  El  follows  from  LI  trivially,  LAD  is  assumed  on  the  right, 
and  LCSN  follows  from  LI  via  a  conditional  form  of  Theorem  2.8  of  Joag-Dev  and  Proschan 
(1982).  □ 

Theorem  4.2(a)  gives  a  complete  characterization  of  d^,  =  1  representations  among 
“smooth”  representations  satisfying  the  mild  monotonicity  condition  LAD — this  is  essen¬ 
tially  the  cla^s  of  useful  di  =  1  representations,  assuming  that  the  distribution  of  0  is  not 
concentrated  at  one  point.  The  characterization  is  of  interest  because  the  conditions  MM, 
CA  and  El,  are  all  conditions  on  the  manifest  structure  P[Xj  =  of  A-  LCSN  is  not  itself 
a  “manifest”  condition,  but  it  appears  to  be  quite  close  to  its  natural  manifest  structure 
analogue  CSN  in  practice.  Theorem  4.2(b)  gives  reasonably  general  conditions  on  the  man¬ 
ifest  structure  of  A  which  are  sufficient  to  guarantee  a  “useful”  d^  =  1  representation.  Note 
that  overly  restrictive  assumptions,  such  as  detailed  knowledge  of  the  forms  of  the  ICC's  or 
of  the  distribution  of  0,  are  not  needed  in  this  approach. 

It  is  important  to  point  out  that  Theorem  4.2  is  possible  only  through  the  use  of  an 
infinite  item  sequence  A.  The  constraints  put  on  the  latent  structure  of  a  finite  set  of  items 
X  I  =  ( A^i , . . . ,  Xj)  by  the  distribution  of  2fj  are  not  strong  enough,  in  general,  to  guarantee 
a  particular  form  for  the  latent  structure.  The  principal  difference  between  the  finite  case 
X  f  and  the  infinite  case  A  is  that,  whereas  a  latent  trait  may  be  only  imperfectly  estimated 
using  it  can  be  known  with  complete  accuracy  using  .A  (see  Levine,  1985,  for  a  related 


23 


discussion  about  the  limits  of  our  ability  to  know  G  from  20  for  finite  J).  Thus  the  use 
of  (conceptually)  infinite  sequences  of  items  seems  absolutely  vital  to  clarify  model-building 
and  model-identification  issues  (see  also  Stout,  1990,  for  a  discussion  of  this  point). 

5  Discussion 

In  this  paper  we  have  combined  the  conditional  association  (CA)  approach  of  Holland  and 
Rosenbaum  and  the  essential  unidimensionality  [d£  =  1)  approach  of  Stout  to  produce  a 
nearly  complete  nonparametric  characterization  of  useful  di  =  I  representations  for  dichoto¬ 
mous  IRT  data.  The  three  principles  Ul,  U2  and  U3  for  a  useful  representation  require 
that  the  latent  trait  0  can  be  consistently  estimated  from  the  item  responses:  that  0  is 
monotonically  related  to  the  test’s  “true  score;”  and  that  0  is  not  constant  in  the  examinee 
population. 

In  Section  2  we  reviewed  the  CA  and  dg  =  1  conditions.  A  crucial  feature  of  our 
analysis  was  the  embedding  of  the  finite-length  test  ^  into  an  infinite  sequence  of  items 
X_  which  extends  the  features  of  the  observed  set  of  items  Xj.  This  embedding  is  needed 
for  two  reasons:  first,  several  authors  have  observed  that  estimation  of  traits  to  arbitrary 
precision — and  hence  identification  of  latent  structure — cannot  be  expected  to  work  unless 
the  test  length  is  allowed  to  grow  without  bound;  and  second,  the  embedding  is  needed  to 
discuss  Stout’s  notions  of  essential  unidimensionality.  Moreover  in  many  practical  settings, 
the  embedding  can  be  justified  as  a  continuation  of  the  usual  process  used  to  manufacture 
items  (c.f.  Stout,  1990).  Under  CA,  all  three  definitions  of  df;  =  1  proposed  by  Stout  are 
equivalent,  and  CA  and  ds  =  I  together  bring  us  close  to  a  =  1  representation. 

In  Section  3  we  developed  two  further  conditions  which  seem  to  be  needed  to  obtain  a 
characterization  of  strict  unidimensionality.  First,  the  negative  covariance  condition  CSN 
(Cov  (A,,  AjIA'j)  <  0)  is  a  natural  one  to  add  to  CA  and  d^  =  1  to  ensure  that  the  items 
are  locally  independent  with  respect  to  0.  CSN  is  guaranteed  to  be  true  in  the  Rasch 
model,  and  a  simulation  in  Appendix  C  shows  that  CSN  is  also  plausible  in  tw’o- parameter 
and  three-parameter  logistic  models.  Second,  monotonicity  of  the  empirical  ICC’s  TfAj  = 
l|A^j  —  Xj/J]  is  intimately  related  to  ICC  monotonicity:  this  “manifest  monotonicity”  must 
hold  if  di,  =  1  holds;  and  conversely  it  can  be  used  to  verify  monotonicity  of  the  usual  ICC's 


24 


P]{0)  when  dg  =  I  holds. 

In  Section  4  we  showed  that  useful  =  1  representations  may  be  characterized  by  the 
conditions  CA,  df;  =  1,  manifest  monotonicity,  and  a  local  version  of  CSN.  If  the  local  CSN 
condition  is  replaced  with  CSN  itself,  we  obtain  a  fairly  general  set  of  sufficient  conditions 
on  the  manifest  structure  P[X  ,  =  X  r]  as  J  oo,  for  a  useful  d£,  =  1  representation  to 
hold.  These  conditions  have  the  agreeable  property  that  parametric  forms  of  the  ICC's  are 
not  needed  to  check  d^,  =  1. 

It  is  important  to  note  that  some  form  of  monotonicity  or  ICC  smoothness  is  needed  to 
avoid  meaningless  models;  an  example  in  Appendix  A,  due  to  Suppes  and  Zanotti  (1981), 
illustrates  this.  Our  preferred  “nonparametric”  condition  has  been  Stout’s  local  asymptotic 
discrimination  LAD  condition.  In  parametric  settings,  a  general  monotonicity  condition  such 
as  LAD  might  be  dropped  in  the  face  of  other  smoothness  available  from  the  parametric  form 
of  the  model.  However  such  a  condition  is  often  plausible,  even  if  the  ICC’s  are  not  monotone, 
and  greatly  enhances  the  interpretability  oi  the  model. 

The  results  of  this  paper  bring  us  quite  close  to  a  characterization  of  useful  di  =  I 
representations  solely  in  terms  of  the  distribution  of  the  manifest  item  response  data.  This  is 
valuable  for  two  reasons.  First,  it  is  hoped  that  presenting  them  stimulates  further  discussion 
of  the  bcLsic  components  of  IRT  modeling.  For  example,  the  local  form  of  CSN  in  the 
characterization  above  suggests  that  both  a  positive  covariance  condition  like  CA,  and  a 
negative  covariance  condition  like  CSN  seem  needed  to  properly  understand  the  general 
d[^  =  1  assumptions.  Second,  such  a  characterization  suggests  practical  nonparametric 
tests  of  fit  for  the  general  d[^  =  1  representation.  But  the  practical  application  of  the 
characterization  theorem  requires  one  to  deal  with  a  potentially  difficult  multiple  inference 
problem:  testing  CSN  involves  combining  J  •  ^  ^  statistical  tests,  and  CA  involves  even 

more  statistical  tests.  Understanding  this  multiple-inference  problem  and  exploiting  possible 
dependencies  among  the  tests  to  reduce  the  problem  is  an  important  future  step  in  this 
research. 


25 


Acknowledgements 


A  preliminary  version  of  this  material  appeared  as  Chapter  4  of  the  author’s  Ph.  D.  disser¬ 
tation.  This  work  was  greatly  facilitated  by  enjoyable  and  stimulating  discussions  with  Bill 
Stout.  Comments  on  parts  of  this  work  by  Bertrand  Clarke  and  Paul  Holland  were  very 
much  appreciated.  Finally,  the  author  is  grateful  to  Ivo  Molenaar  for  allowing  the  examples 
in  Appendix  B  to  appear  in  this  report. 


A  The  “necessity”  of  LI,  M  and  D 


The  assumptions  LI,  M  and  D  can  be — and  sometimes  are — weakened  but  not  dropped:  if 
any  one  of  them  is  completely  dropped,  the  resulting  version  of  (1)  can  be  made  to  fit  any 
distribution  of  dichotomous  items.  Two  of  the  examples  which  show  this  are  known  in  the 
literature,  but  we  repeat  them  here  for  completeness.  Example  A. 3  appears  to  be  new. 

Example  A.l  d  =  I  and  LI  hold,  but  M  dropped.  (Suppes  and  Zanotti,  1981).  Here 
we  allow  the  ICC’s  to  be  arbitrarily  rough  and  nonmonotone;  our  goal  is  to  represent  an 
arbitrary  distribution  P[X  t  =  ij]  as  in  (1).  Let  0  =  Yli  i.e.  0  is  the  base-two 

fraction  O.X1X2  . .  .Xj.  Then  the  ICC’s  may  be  written  as 

P[Xj  f|^]  f  {Int  (2.'-S)  mod  2=1 }  1 


where  “Int  (t)"  is  the  integer  part  of  t,  and  Ic  equals  1  when  C  is  true  and  0  when  C  is  false. 
The  likelihood  may  be  written  as  usual  as  under  LI,  and  the  distribution  of  0  is  described 
by  a  probability  mass  function 

0,  if^^. 

Finally,  (1)  becomes 


p|i;  =  xyi=  E 


This  example  can  be  modified  to  allow  infinitely  many  items,  items  with  more  than  two 
response  categories,  and  items  with  continuous  responses.  □ 


26 


Example  A. 2  LI  and  M  hold,  but  d  =  J .  (Holland  and  Rosenbaum,  1986;  Stout,  1990). 
To  write  the  representation  (1)  in  this  setting,  we  put  0  =  X j.  so  that  each  component 
6j  =  Xj  records  exactly  the  response  to  the  item.  The  ICC’s  may  be  written  as 

p[x,  =  iia  = 

which  are  certainly  coordinatewise  nondecreasing  in  the  multivariate  distribution  of  0  is 
exactly  the  same  cls  for  X  r. 

f{d)  =  P[Q  =  £]  =  P{Kj  =  £]• 


Here,  (1)  becomes 

=  2ji  =  E  n  mr’O  - 

e  j=i 

Example  A. 3  d  —  1  and  M  hold,  but  LI  dropped.  Here  we  set  0  =  H/  ^j-  The  ICC’s 


P[X,  =  1|0]  = 


1,  if  0  =  1; 
0,  if  0  =  0 


remain  monotone,  but  the  joint  likelihood  cannot  be  written  as  under  LI.  Instead, 


P[Kj  =  Xj], 
P[Kj  =  Xj], 
0, 


if  0  =  0  and  Xj  < 
if  0  =  1  and  Yli  X;  =  J', 
otherwise 


and  the  probability  mass  function  for  0  is 


f{0)  =  p[Y:ix,  =  jYPizix,  < 


Finally,  the  representation  (1)  is  simply  written  as 


e=o 

It  is  worth  remarking  that  each  of  these  examples  is  very  much  degenerate,  as  a  piece  of 
psychometric  modeling.  In  Example  A.l  and  Example  A. 2  the  latent  variables  chosen  are 


27 


not  latent  at  all,  in  that  they  are  completely  determined  by  the  examinee’s  responses  (and 
thus  represent  no  broader  sense  of  a  psychological  construct  than  the  examinee’s  responses 
themselves).  In  Example  A. 3  the  problem  is  the  “opposite”:  here  6  captures  virtually  nothing 
about  the  examinee’s  response  behavior.  The  examples  should  be  taken  as  indications  of 
how  quicky  the  modeling  process  can  devolve  if  the  assumptions  LI,  M  and  D  are  weakened 
too  much. 


B  Limits  on  generalizing  MM 


Molenaar  (priv.  comm.)  has  independently  discovered  Proposition  3.1(a),  and  reports  coun¬ 
terexamples  indicating  the  limitations  to  extensions  of  this  result.  Example  B.l  shows  that 
the  method  of  proof  of  Lemma  3.1  cannot  be  extended  to  the  polytomous  case.  Exam¬ 
ple  B.2  shows  that,  in  Proposition  3.1(a),  we  cannot  replace  the  “delete  average”  Xu  = 
7  Li  ~  ^i/J  with  the  more  natural  average  over  all  items  Xj. 


Example  B.l  (I.  Molenaar) .  The  monotone  likelihood  ratio  property  (15)  of  Grayson 
(1988)  does  not  extend  to  polytomous  items.  Let  0  <  0  <  1  and  consider  a  single  graded- 
response  item  X  taking  the  three  values  0,  1  or  2,  with 


P[X>l\e]  = 


P[X  >  2\6]  = 


I  30,  0  <  0  <  1/4 

I  I  +  i0,  1/4  <  0  <  1 
0,  0  <  0  <  1/4 

<  3(0 -i),  l/4<0<l/2 

,  i  -k  10,  1/2  <  0  <  1 


In  this  case,  for  0o  =  1/4  and  0i  =  1/2,  the  likelihood  ratio 


P[X  =  x\0o] 
P[X  =  a:10,] 

is  not  monotone  in  x.  For  one  may  calculate 


00  =  1/4 

01  =  1/2 

Ratio 

P[X  =  O|0] 

1/4 

1/8 

2 

II 

3/4 

1/56 

42 

II 

to 

0 

6/7 

0 

(22) 


28 


Here  the  ratio  (22)  is  not  monotone  in  x.  A  consequence  of  (15)  in  the  proof  of  Lemma  3.1 
would  be  that  the  likelihood  ratio  (22)  is  monotone  in  x;  since  it  is  not,  the  proof  of  Lemma  3.1 
cannot  be  extended  to  the  polytomous  case.  □ 


Example  B.2  (T.  Snijder).  P[Xj  =  IjA^j  =  s]  need  not  increase  with  s,  and  hence  we 
may  not  replace  Xij  with  Xj  in  Proposition  3.1(a).  Consider  three  dichotomous  item,  and 
a  two-point  distribution  for  0,  P(0  =  6q)  =  P(0  =  0i)  =  |.  Let 


P,(eo)  =  £,i  =  1,2,3; 
PM)  =  j; 

.^2(^1)  =  1  —  c;  and 
P3(^i)  =  1-e. 


It  follows  that,  as  c  — >  0, 


p[a:i  =  i|a^  =  i/3]  = 


(l-er-h^e  1 
3(1  -c)2-|-  1  -  ie  ^  4 


=  l|.Vj  =  2/31  = 


2  + 


Hence  for  small  e  >  0,  P[Ai  =  l\Xj  =  1/3]  >  P[Xi  =  l\Xj  =  2/3].  □ 


C  An  illustration  of  CSN 


We  have  argued  in  Section  3.2  that  the  CSN  condition 

Cov(A.,A0lXy)  <  0 


is  a  reasonable  one  to  look  for  in  dichotomous  test  data,  to  ensure  that  the  local  (given 
Q  =  6)  association  between  items  is  not  too  strong,  under  CA,  for  LI  to  hold. 

To  illustrate  the  plausibility  of  CSN  in  common  d^  =  I  IRT  models,  a  small  simulation 
study  was  performed.  Item  characteristic  curves  were  taken  to  be  of  the  logistic  form 


Pj{^)  =  Cj  +  (1  -  C;) 


_ 1 _ 

1  -I-  exp{-1.7aj(<?  -  6j)} 


(23) 


29 


All  three  of  the  popular  ICC  models  based  on  (23)  were  considered:  the  Rasch  model  (one 
parameter  logistic,  IPL)  in  which  Cj  =  0  and  aj  =  1;  the  two  parameter  logistic  model  (2PL) 
in  which  Cj  =  0;  and  the  three  parameter  logistic  (3PL)  model  in  which  all  parameters  are 
free.  Free  parameters  were  generated  as  appropriate  for  each  model  as  follows. 

•  Cj  =  0.5  +  0.5(j  mod  3)  +  aj,  where  aj  is  Af(0, 0.0625)  [normal,  mean  zero,  variance 
0.0625]  noise,  truncated  so  that  0.5  <  aj  <  1.5; 

•  bj  =  —2.0  +  4.035y  +  where  /?_,  is  A^(0,  jj)  noise; 

•  Cj  =  0.2  +  7j,  where  7^  is  A^(0,0.01)  noise,  truncated  so  that  0.0  <  Cj  <  0.4. 

For  each  of  the  three  models,  a  2000-examinee  test  administration  was  simulated,  for  varying 
test  lengths  J.  Examinee  abilities  were  sampled  from  a  A’^(0, 1)  distribution  and  responses 
were  generated  according  to  the  corresponding  locally  independent  IRT  model. 

The  Mantel-Haenszel  (M-H)  one-sided  z-test  for 

Ho:  CoviX„X,\Xj)  <  0  vs.  Hu  Cov (X^, XjjXj)  >  0 

combined  across  Xj  categories,  was  performed  for  each  pair  i,  j.  The  number  of  tests  for 
which  the  Mantel-Haenszel  z  exceeds  1.28  (corresponding  to  a  nominal  level  p  <  0.10)  is 
tallied,  and  these  particular  tests  are  displayed  in  detail. 

Note  that,  since  ^  2  )  statistical  tests  are  performed — one  for  each  pair  {i.j) — even  if 
Ho  is  true,  one  would  normally  expect  some  significantly  positive  covariances  because  of 
‘‘capitalization  on  chance.”  Hence  there  is  a  severe  multiple  inference  problem,  paralleling 
a  similar  problem  with  the  CA  condition.  Understanding  this  multiple-inference  problem 
and  exploiting  possible  dependencies  among  the  tests  to  reduce  the  problem  is  an  important 
future  step  in  this  research. 

The  results  for  the  Rasch  model  are  displayed  in  Table  1  and  Table  2.  Since  LCSN  and 
CSN  are  equivalent  under  the  Rasch  model.  Ho  is  always  true;  hence  the  nine  large  M-H  r's 
in  Table  1  are  exclusively  due  to  Type  I  errot  (capitalization  on  chance).  Hence  Table  1  may 
be  taken  as  a  baiseline:  if  in  another  model  the  number  of  large  M-H  c's  is  smaller  than  for 
the  Rasch  model,  we  may  be  confident  in  Ho,  and  hence  (13). 


30 


The  results  for  the  2PL  simulation  are  displayed  in  Tables  3  and  4,  and  for  the  3PL 
simulation  in  Tables  5  and  6.  Remarkably,  fewer  large  M-H  z’s  were  found  for  these  models — 
where  the  equivalence  of  LCSN  and  CSN  is  not  known  theoretically — only  two  large  M-H  z's 
were  found  for  the  2PL  simulation,  and  four  were  found  for  the  3PL  simulation.  Moreover, 
there  are  no  large  M-H  r’s  for  the  longer  tests,  suggesting  the  validity  of  (12)  as  J  — >  oo. 


J 

#  M-H  tests 

^  M-H  r’s  above  1.28 

10 

45 

0 

20 

190 

0 

40 

780 

9 

80 

3160 

0 

Table  1:  Number  of  large  positive  associations  for  2000-examinee  Rasch  simulation  ( J  =  test 
length). 


i 

J 

M-H  z 

Nominal  p 

^MH 

In(dAfH) 

32 

7 

1.403 

0.080 

1.634 

0.491 

36 

1 

1.779 

0.038 

3.172 

1.154 

36 

8 

1.434 

0.076 

1.640 

0.495 

37 

5 

1.497 

0.067 

2.142 

0.762 

37 

7 

2.401 

0.008 

6.083 

1.805 

37 

27 

1.347 

0.089 

1.295 

0.259 

38 

7 

1.502 

0.067 

2.539 

0.932 

39 

6 

1.408 

0.080 

1.196 

0.179 

40 

1.503 

0.066 

3.343 

1.207 

Table  2;  Details  for  40-item  Rasch  simulation  (qa///  =  est.  common  odds  ratio  across  X j 
categories). 


31 


J 

M-H  tests 

#  M-H  z's  above  1.28 

10 

45 

1 

20 

190 

1 

40 

780 

0 

80 

3160 

0 

Table  3:  Number  of  large  positive  associations  for  2000-examinee  2PL  simulation  ( J  =  test 
length). 


J 

i 

i 

M-H 

Nominal  p 

ln(Q!Af//) 

10 

1 

1 

1.683 

0.046 

2.981 

1.092 

20 

18 

4 

1.460 

0.072 

2.227 

0.801 

Table  4:  Details  for  10-  and  20-item  2PL  simulations  {oimh  =  est.  common  odds  ratio  across 
Xj  categories). 


J 

#  M-H  tests 

#  M-H  2’s  above  1.28 

10 

45 

0 

20 

190 

4 

40 

780 

0 

80 

3160 

0 

Table  5:  Number  of  large  positive  associations  for  2000-examinee  3PL  simulation  ( J  —  test 
length). 


i 

a 

M-H  2 

Nominal  p 

OMH 

In(dAfH) 

3 

1 

mgmm 

4 

1 

■EH 

1.654 

■i9 

4 

3 

1.777 

■EH 

1.300 

20 

1 

1.568 

1.411 

0.344 

Table  6:  Details  for  20-item  3PL  simulation  (oa///  =  est.  common  odds  ratio  across  X j 
categories). 


32 


References 

Ash,  R.  B.  (1972).  Real  Analysis  and  Probability.  Academic  Press.  New  York. 

Birnbaum,  A.  (1968).  Some  latent  trait  models  and  their  use  in  inferring  an  examinee's 
ability,  in  Lord,  F.  M.  and  Novick,  M.  R.  (1968)  Statistical  Theory  of  Mental  Test 
Scores.  Addison-Wesley.  Reading,  Massachusetts. 

Clarke,  B.  S.  (1990).  Private  communication. 

Clarke,  B.  S.  and  Junker,  B.  W.  (1990).  Inference  from  the  product  of  marginals  of  a  depen¬ 
dent  likelihood.  Working  paper. 

Cressie,  N.  and  Holland,  P.  W.  (1983).  Characterizing  the  manifest  probabilities  of  latent 
trait  models.  Psychometrika,  48,  129-141. 

Esary,  J.  D.,  Proschan,  F.  and  Walkup,  D.  W.  (1967).  Association  of  random  variables,  with 
applications.  Annals  of  Mathematical  Statistics,  38  1466-1474. 

Fienberg,  S.  E.  (1986).  The  Rasch  Model,  in  Kotz,  S.,  Johnson,  N.  L.  and  Read,  C.  B.,  eds.. 
Encyclopedia  of  Statistical  Sciences,  7,  627-632.  John  Wiley  and  Sons,  Inc.  New  \’ork. 

Grayson,  D.  A.  (1988).  Two-group  classification  in  latent  trait  theory:  scores  with  monotone 
likelihood  ratio.  Psychometrica,  53,  383-392. 

Holland,  P.  W.  (1981).  When  are  item  response  models  consistent  with  observed  data? 
Psychometrika,  46,  79-92. 

Holland,  P.  W.  and  Rosenbaum,  P.  R.  (1986).  Conditional  cissociation  and  unidimensionality 
in  monotone  latent  trait  models.  Annals  of  Statistics,  I4,  1523-1543. 

Jamarone,  R.  J.  (1986).  Conjunctive  item  response  theory  kernels.  Psychometrika.  51.  357- 
o73. 

Joag-Dev.  K.  (1983).  Independence  via  uncorrelatedness  under  certain  dependence  struc¬ 
tures.  Annals  of  Probability,  11,  1037-1041. 


33 


Joag-Dev,  K.  and  Proschan,  F.  (1982).  Negative  association  of  random  variables,  with  ap¬ 
plications.  Annals  of  Statistics,  10,  286-295. 

Junker,  B.  W.  (1988).  Statistical  aspects  of  a  new  latent  trait  theory.  Ph.D.  dissertation. 
Department  of  Statistics,  University  of  Illinois  at  Urbana-Champaign. 

Junker,  B.  VV.  (1991).  Essential  independence  and  likelihood-based  ability  estimation  for 
polytomous  items.  To  appear,  Psychometrika. 

Lehmann,  E.  L.  (1955).  Ordered  families  of  distributions.  Annals  of  Mathematical  Statistics, 
26,  399-419. 

Levine,  M.  V.  (1985).  Representing  ability  distributions.  Office  of  Naval  Research,  Research 
Report  85-1.  Model-based  Measurement  Laboratory,  Department  of  Educational  Psy¬ 
chology,  University  of  Illinois  at  Urbana-Champaign. 

Lord,  F.  M.  (1980).  Application  of  Item  Response  Theory  to  Practical  Testing  Problems. 
Lawrence  Erlbaum  Associates,  Inc.  Hillsdale,  N.  J. 

Molenaar,  1.  (1990).  Private  communication. 

Nandakumar,  R.  (1987).  Refinement  of  Stout’s  procedure  for  assessing  latent  trait  unidi¬ 
mensionality.  Ph.D.  dissertation.  Department  of  Education,  University  of  Illinois  at 
Urbana-Champaign. 

Nandakumar,  R.  (1989).  Validation  of  Stout’s  procedure  for  assessing  latent  trait  unidi¬ 
mensionality  with  real  tests.  Paper  presented  at  the  1989  European  Meeting  of  the 
Psychometric  Society,  Leuven,  Belgium. 

.Nandakumar,  R.  and  Stout,  W.  F.  (1990).  Refinement  of  a  procedure  for  assessing  latent 
trait  unidimensionality.  Submitted. 

.Newman,  C.  M.  and  Wright,  A.  L.  (1981).  An  invariance  principle  for  certain  dependent 
sequences.  Annals  of  Probability,  9,  671-675. 

Rasch,  G.  (1980).  Probabilistic  models  for  some  intelligence  and  attainment  tests.  (Expanded 
edition,  1980.)  University  of  Chicago  Press.  Chicago,  Illinois. 


34 


Rosenbaum,  P.  R.  (1984).  Testing  the  conditional  independence  and  monotonicity  assump¬ 
tions  of  Item  Response  Theory.  Psychometrika,  49,  425-436. 

Rosenbaum,  P.  R.  (1985).  Comparing  distributions  of  item  responses  for  two  groups.  British 
Journal  of  Mathematical  and  Statistical  Psychology,  38,  206-215. 

Rosenbaum,  P.  R.  (1987).  Probability  inequalities  for  latent  scales.  British  Journal  of  Math¬ 
ematical  and  Statistical  Psychology,  JO,  157-168. 

Rosenbaum,  P.  R.  (1988).  Item  bundles.  Psychometrika,  53,  349-359. 

Stout,  W.  F.  (1987).  A  nonparametric  approach  for  assessing  latent  trait  unidimensionality. 
Psychometrika,  52,  589-617. 

Stout,  W.  F.  (1988).  A  nonparametric  multidimensional  IRT  approach  with  applications 
to  ability  estimation  and  test  bias.  Office  of  Naval  Research,  Research  Report  88-1. 
Department  of  Statistics,  University  of  Illinois  at  Urbana- Champaign. 

Stout,  W.  F.  (1990).  A  new  item  response  theory  modeling  approach  with  applications  to 
unidimensionality  assessment  and  ability  estimation.  Psychometrika.  55,  293-325. 

Suppes,  P.  and  Zanotti,  M.  (1981).  When  are  probabilistic  e.xplanations  possible?  Synthese. 
48,  191-199. 

Sympson,  J.  B.  (1987).  Applications  of  a  polychotomous  IRT  model  to  adaptive  mental 
testing.  Paper  presented  at  the  Office  of  Naval  Research  Contractors’  Meeting  on  Model- 
based  Psychological  Measurement,  June  1987,  University  of  South  Carolina.  Columbia. 
South  Carolina. 

Zwick,  R.  (1987)  Assessing  the  dimensionality  of  NAEP  reading  data.  Journal  of  Educational 
Measurement,  24,  293-308. 


35 


Dtttribution  Lui 


Dr.  Terry  Ackeroan 
Educational  Psycbok>8y 
210  Education  Bld^ 

University  of  Illinois 
Cbampaign,  IL  61801 

Dr.  James  Algjna 
1403  Norman  Hall 
University  of  Florida 
Gainesville,  FL  32605 

Dr.  Erling  B.  Andersen 
Department  of  Statistics 
Siudiestraede  6 
1455  Copenhagen 
DENMARK 

Dr.  Ronald  Annstrong 
Rutgers  University 
Graduate  School  of  Management 
Newark,  NJ  07102 

Dr.  Eva  L  Baker 
UCLA  Center  for  the  Study 
of  Evaluation 
145  Moore  Hail 
University  of  California 
Los  Angeles,  CA  90024 

Dr.  Laura  L.  Barnes 
College  of  Education 
University  of  Toledo 
2801  W.  Bancroft  Street 
Toledo,  OH  43606 

Dr.  William  M.  Bart 
University  of  Minnesota 
Dept,  of  Educ  Psychology 
330  Burton  Hail 
178  Pillsbury  Dr.,  SE. 
Minneapolis.  MN  55455 

Dr.  Isaac  Sejar 
Law  School  Admissions 
Services 
P.O.  Box  40 

Newtown,  PA  16940>0040 

Dr.  Menucba  Birenbaum 
School  of  Education 
Tel  Aviv  University 
Ramat  Aviv  69978 
ISRAEL 

[>r.  Arthur  S  Btaiwes 
Code  N712 

Naval  Training  Systems  Center 
Orlando,  FL  32813-7100 

Dr.  Bruce  Bloxom 
Defense  Manpcwer  Data  Center 
99  PaciHc  Sl 
Suite  155A 

Monterey.  CA  93943-3231 

CdL  Arnold  Bobrer 
Secue  Psycbologisch  Onderzoek 
Rekrutehngs-En  Selectiecentrum 
Kwartier  Koningen  Astrid 
Bruijnstraat 

1120  Brussels,  BELGIUM 

Dr.  Robert  Breaux 
Code  281 

Naval  Training  Systems  Center 
Orlando.  FL  32826-3224 

Dr.  Robert  Brennan 
Amencan  College  Testing 
Programs 
P.  O.  Box  168 
IcMva  Gty.  LA  52243 

Dr.  Gregory  Candcll 
CTB/McGrsw-Hill 
2500  Garden  Road 
Monterey.  CA  93940 


Dr.  John  B.  Carroll 
409  Elliou  Rd.,  North 
Chapel  Hill,  NC  27514 

Dr.  John  M.  Carroll 
IBM  Watson  Research  Center 
User  Interface  Institute 
P.O.  Box  704 

Yorktown  Heights.  NY  10598 

Dr.  Robert  M  Carroll 
Chief  of  Naval  Operations 
OP-01B2 

Washington,  DC  20350 

Dr.  RaymoTKl  &  Chrisial 
UES  LAMP  Sdertoe  Advisor 
AFHRUMOEL 
Brooks  AFB,  TX  78235 

Mr.  Hua  Hua  Chung 
University  of  Illinois 
Department  of  Sutistics 
101  mini  Hall 
725  South  Wright  St. 

Champaign.  IL  <<1820 

Dr.  Norman  Gift 
Department  of  Psychology 
Unrv.  of  So.  California 
Los  Angeles,  CA  90069-1061 

Director,  Manpower  Program 
Center  for  Naval  Analyses 
4401  Ford  Avenue 
P.O.  Box  16268 
Alexandria,  VA  22302-0268 

Director. 

Manpc^r  Support  and 
Readiness  Program 
Center  for  Naval  Analysis 
2000  North  Beauregard  Street 
Alexandria.  VA  22311 

Or.  Stanley  Collyer 
OfTtce  of  Naval  Technolo^ 

Code  222 

800  N.  Quincy  Street 
Arlington,  VA  22217-5000 

Dr.  Hans  F.  Crombag 
Faculty  of 

Unrversity  of  Limburg 
P.O.  Box  616 
Maastricht 

The  NETHERLANDS  6200  MD 

Ms.  Carolyn  R.  Crone 
Johns  Hopkins  Unrversity 
Department  of  Psychology 
Charles  &  34th  Street 
Baltimore,  MO  21218 

Dr.  Timothy  Davey 
American  College  Testing  Propam 
P.O.  Box  168 
Iowa  City.  lA  52243 

Or.  C  M.  Dayton 
Department  of  Measurement 
SiatisUGS  6c  Evaluation 
College  of  Education 
UniversiCy  of  Maryland 
College  Part.  MD  20742 

Dr.  Ralph  J.  DeAyala 
Measurement.  Staustics. 

and  Evaluation 
Benjamin  Bldg.  Rm.  4112 
Unrversiry  of  Maryland 
College  Part,  MD  20742 


Dr.  Lou  DiBelio 
CERL 

University  of  Illinois 

103  South  Mathews  Avenue 

Urbana,  IL  61801 

Dr.  Dattprasad  Oivgi 
Center  for  Naval  Analysis 
4401  Ford  Avenue 
P.O.  Box  16268 
Aleandria.  VA  22302-0268 

Mr.  Hd-Ki  Dong 

Bdl  Communications  Research 

Room  PYA-IK207 

P.O.  Box  1320 

Piscataway,  NJ  08855-1320 

Dr.  Friu  Drasgow 
University  of  Illinois 
Department  of  Psycholo^ 

603  E  Daniel  Sl 
Champaign,  IL  61820 

Defense  Technical 
informaiion  Center 
Gimeron  Station.  BiJg  5 
Alexandria,  VA  22314 
(2  Copies) 

Dr.  Stephen  Dunbar 
224B  Lindquist  Center 
for  Measurement 
University  of  Iowa 
Iowa  Ory.  lA  52242 

Dr.  James  A.  Earles 

Air  Force  Human  Resources  Lab 

Brooks  AFB.  TX  78235 

Dr.  Susan  Embreuon 
University  of  Kansas 
Psychology  Dcpanmeni 
426  Fraser 
Lawrence.  KS  e6(M5 

Dr.  George  Englehard,  Jr. 

Division  of  Educational  Studies 
Emory  University 
210  Fishbume  Bldg 
Atlanta.  GA  30322 

ERIC  Facility-Acquisitions 
2440  Research  Bhd,  Suite  550 
Rockville.  MD  20850-3238 

Dr.  Benjamin  A.  Fairbanlc 
Operational  Technologies  Corp. 
5825  Callaghan.  Suite  225 
San  Antonio.  TX  78228 

Dr.  Marshall  J.  Farr.  Consultant 
Cognitive  ii  Instrucuonal  Sciences 
2520  North  Vernon  Street 
Arlington.  VA  22207 

Dr.  P-A.  Fedenco 
Code  51 
NPRDC 

San  Dicgo.  CA  92152-n8tJ<' 

Dr.  Leonard  Feldi 
Lindquist  Center 
for  Measurement 
University  of  Iowa 
Iowa  City.  lA  52242 

Dr.  Richard  L  Ferguson 
Amencan  College  Testing 
P.O.  Box  168 
Iowa  Oiy.  lA  52243 

Dr.  Gerhard  Fischer 
Ljehiggasse  5/3 
A  1010  Vienna 
AUSTRIA 


Univenity  of  Ulinod/lunker 


Dr.  Myron  FitcfaJ 
US.  Anwf  Hodquarten 
DAPE-MRR 
Tl'e  Pentagon 

DC  2a31(M1300 

ProC  Donald  Fitzgerald 
Univcr^  of  New  England 
Departinent  of  Paycbolofir 
Ari&idale,  New  South  Waka  23SI 
AUSTRAUA 

Mr.  Paul  Foley 

Navy  Peraonnd  RAD  Center 

San  E>iego.  CA  92152<«800 

Dr.  Alfred  R.  Fregly 
AFOSR/NU  Bld^  410 
Bolling  AFa  DC  20332-6448 

Dr.  Robert  D.  Gibbona 
Illinoia  State  Piychiathc  InaL 
Rm  529W 

1601  W.  Taylor  Street 
Chicago,  IL  60612 

Dr.  Janice  CifTord 
Univetaity  of  Masaacbusetti 
School  of  Education 
Afflhent.  MA  01003 

Dr.  Drew  Giiomer 
Educational  Testing  Service 
Princeton,  NJ  06541 

Dr.  Robert  Glaaer 
Learning  Rcaearch 
A  Devetopmenc  Center 
Unrveniiy  of  Pittsburgh 
3939  O’Hara  Street 
Pttuburgb.  PA  15260 

Dr.  Sbenie  Gou 
AFHRL/MOMJ 
Brooks  AFB,  TX  78235-5601 

Dr.  Bert  Green 
Johns  Hopkins  Univenity 
E>epartment  of  Psychology 
Charles  A  34th  Street 
BaItJiDore,  MD  21216 

Michael  Habon 
DORNIER  GMBH 
F.O.  Box  1420 
D-7990  Friedrichsfaafen  1 
WFST  GERMANY 

Prof  Edward  Haertd 
School  of  Education 
Stanford  Univenity 
Sunford,  CA  ^4305 

Dr.  Ronald  K.  Hambleton 
University  of  Msssachuseus 
Laboratory  of  Psychometric 
and  Evaluative  Research 
Hilts  South.  Room  152 
Amherst,  MA  01003 

Dr.  Delwyn  Hamiscfa 
Univenity  of  Illinois 
51  Gerty  Drive 
Champaign,  IL  61620 

Dr.  Grant  Henning 
Sentor  Research  Sdcnutt 
Division  of  Measurement 
Research  and  Services 
Educatfonsl  Testing  Service 
Prinoetoa  NJ  06541 

Ms.  Rebecca  Hetier 
Navy  Personnel  RAD  Center 
Code  63 

San  Diego.  CA  92152-6800 


Dr.  Thomas  M.  Hinch 
ACT 

P.  O.  Box  166 
Iowa  Gty.  lA  52243 

Dr.  Paul  W.  Holland 
Educational  Testing  Service,  21-T 
Rosedsle  Rosd 
Princeton,  NJ  06541 

Dr.  Paul  Horst 
677  G  Street,  #184 
Chula  Vmu.  CA  92010 

Ms.  Julia  S.  Hough 
Cambridge  UniveniQr  Press 
40  West  20th  Street 
New  York.  NY  10011 

Dr.  William  Howell 
Chief  Scientist 
AFHRUCA 

Brooks  AFB,  TX  78235-5601 

Dr.  Uoyd  Humphreys 
Univenity  of  Illinois 
Department  of  Psychology 
603  East  Daniel  Street 
Champaign.  IL  61620 

Dr.  Steven  Hunka 
3-104  Educ  N. 

University  of  Alberta 
Edmonton,  Alberta 
CANADA  T6G  2G5 

Or.  Huynh  Huynh 
College  of  Education 
Univ.  of  South  Carolina 
Colufflbia.  SC  29206 

Dr.  Robert  Jannarone 
Elec  and  Computer  Eng  Dept. 
University  of  S^th  Carolina 
Columbia,  SC  29206 

Dr.  Kumar  Joag-dev 
Univenity  of  Illinois 
Department  of  Statistics 
101  Illini  Hall 
725  South  Wngbt  Street 
Champaign.  IL  618^ 

Dr.  Douglas  H.  Jones 
1280  Woodfern  Coun 
Toms  River.  NJ  08753 

Dr.  Brian  Junker 
Carnegie-Mellon  University 
OeparUDeni  of  Stacisucs 
Scbenley  Park 
Pittsburgh.  PA  15213 

Dr.  Michael  Kaplan 
Office  of  Basic  Research 
U.S  Army  Research  Institute 
5001  Eisenhower  Avenue 
Aleandna.  VA  22333-5600 

Dr.  Milton  S.  Kau 
European  Science  Coordination 
Office 

U.S  Army  Research  Institute 
Box  65 

FPO  New  York  09510-1500 

Prof.  John  A.  Keats 
Department  of  Psyebotogr 
Univenitv  of  Newcastle 
■N.SW.  2306 
AUSTRAUA 


Dr.  Jwa-keun  Kim 
Department  of  Psychology 
Mi^le  Tennessee  State 
Univenity 
P.O.  Box  522 
Murfreesboro.  TN  37132 

Mr.  Soon-Hoon  Kim 
Computer-based  Education 
Research  Laborstocy 
Univenity  of  Illinois 
Urbans.  IL  6180i 

Dr.  G.  Cage  Kingsbury 

Portland  Public  Schools 

Research  and  Evaluation  Depanmeni 

501  North  Dixon  Street 

P.  O.  Box  3107 

Portland.  OR  97209-3107 

Dr.  William  Koch 
Box  7246,  Meas.  ar»d  EvaL  Ctr. 
Universiiy  of  Tens-Austin 
Austin,  TO  78703 

Dr.  Richard  J.  Koubek 
Department  of  Biomedical 
A  Human  Facton 
139  Engineering  A  Maib  Bldg 
Wright  State  Univeniiy 
Dayioa  OH  45435 

Dr.  Leonard  Kroeker 
Navy  Personnel  RAD  Center 
Code  62 

San  Diego.  CA  92152-6800 

Or.  Jerry  Lebnus 

Defense  Manpower  Data  Center 

Suite  400 

1600  Wilson  BNd 

Rosslyn.  VA  22209 

Dr.  Thomas  Leonard 
Univenity  of  Wisconsin 
Department  of  Siausucs 
1210  West  Dayton  Su^eei 
Madison.  Wl  5.^705 

Dr.  Michael  Levnne 
Educational  Psychology 
210  Education  Bldg 
University  of  Illinoii 
Champaign.  IL  91801 

Dr.  Charles  Leivis 
Educational  Tesung  Service 
Pnnccton.  NJ  0854I-0001 

Mr.  Rodney  \jm 
University  of  Illinois 
Department  of  Psychology 
603  E  Daniel  Sl 
Champaign.  IL  61820 

Dr.  Robert  L  Linn 
Campus  Box  249 
Univenity  of  Colorado 
Boulder.  CO  80309-0249 

Dr.  Robert  Lockman 
Center  for  Naval  Analysis 
4401  Ford  Avenue 
P.O.  Box  16268 
Aleandna.  VA  22302-0268 

Dr.  Frederic  M.  Lord 
Educational  Testing  Service 
Pnncetoa  NJ  08541 

Dr.  Rxhard  Luecht 
ACT 

P.  O.  Box  168 
Iowa  Gty.  lA  52243 


Univmity  of  Illinoia/Junker 


Dr.  George  B.  Mecreedy 
Depertment  of  McaMirement 
Scaimic*  &  Evtiuetion 
College  of  Education 
Univenity  of  Maryland 
College  Part.  MD  20742 

Dr.  Gary  Marco 
Stop  31>E 

Educational  Teating  Service 
Princetoa  NJ  06451 

Dr.  Clesaen  J.  Martin 
Office  of  Chief  of  Naval 
Opentiona  (OP  13  F) 

Navy  Annex.  Room  2832 
Washington,  DC  20350 

Dr.  James  R.  McBride 
HumRRO 

6430  Elmburat  Drive 
San  Diego,  CA  92120 

Dr.  Qarence  C  McCormick 
HO.  USMEPCOM/MEPCT 
2500  Green  Bay  Road 
North  Chicago,  IL  60064 

Mr.  Christopher  McCusker 
Univenity  of  Illinois 
[>epartffient  of  Psychology 
603  E.  Daniel  Sl 
Champaign,  IL  61820 

Dr.  Robert  McKinley 
Educatiorul  Testing  Service 
Princeton,  NJ  06541 

Mr.  Alan  Mead 
do  Dr.  Michael  Levine 
Educationai  Psychology 
210  Education  Bk)^ 

Unnemcy  of  Illinois 
Champaign.  IL  61801 

Dr.  Timothy  Miller 
ACT 

P.  O.  Box  168 
Iowa  Ory.  LA  52243 

Dr.  Robert  Mtslevy 
Educational  Testing  Service 
Pnnceioa  NJ  08541 

Dr.  William  Montague 

NPRDC  Code  13 

San  Diego.  CA  92152-6800 

Ms.  Kathleen  Morerx) 

Navy  Personnel  R&D  Center 
Code  62 

San  Diego.  CA  92152-6800 

Headquarters  Marine  Corps 
Code  MPI-20 
Washington,  DC  20380 

Dr.  Ratna  Nandakumar 
Educational  Studies 
Willard  Halt,  Room  213E 
Universtty  of  Delaware 
Newark.  DE  19716 

library.  NPRDC 
Code  P201L 

San  Diego.  CA  92152-6800 
Lbrahan 

Naval  Center  for  Applied  Research 
in  AiufK^al  Intelligence 
Naval  Research  Laboratory 
Code  5510 

Wasbingloa  DC  20375-5000 


Dr.  Harold  P.  O’Neil.  Jr. 

School  of  Education  •  WPH  801 
Department  of  Educatiorul 
Psychology  A  Technology 
Universify  of  Southern  Califomia 
Loa  Angeka.  CA  90089^)031 

Dr.  James  B.  Obeo 
WICAT  Systems 
1875  South  State  Street 
Orem,  UT  64058 

Office  of  Naval  Research, 

Code  1142CS 
800  N.  Ouincy  Street 
Arlington.  VA  22217-5000 
(6  Copies) 

Dr.  Judith  Orasanu 
Basic  Research  Office 
Army  Research  Inaiiuite 
5001  Eisenhower  Avenue 
Aloandria.  VA  22333 

Dr.  Jesse  Orlansky 
institute  for  Defense  Analyses 
1801  N.  Beauregard  Sc 
Alexandria,  VA  22311 

Dr.  Peter  J.  Pashiey 
Educatiorul  Teaung  Service 
Roaedale  Road 
Pnnceion.  NJ  08541 

Wayne  M.  Patience 
American  Council  on  Education 
GED  Teating  Service,  Suite  20 
One  Dupont  Circle,  NW 
Washing  DC  20036 

Dr.  James  Paulson 
Department  of  Psychology 
Portland  SL.ie  University 
P.O.  Box  751 
PortUod,  OR  97207 

Depc  of  Administrative  Sciences 
Code  54 

Naval  Postgraduate  School 
Monterey,  CA  93943-5026 

Dr.  Mark  D.  Reckase 
ACT 

P.  O.  Box  168 
Iowa  Oc>.  V  52243 

Dr.  Malcolm  Ree 
AFHRIVMOA 
Brooks  APB.  TX  78235 

Mr.  Steve  Reiss 
N660  Elliott  Hall 
University  of  Minnesota 
75  E.  River  Road 
Minneapolis,  MN  55455-0344 

Dr.  Carl  Rou 
CNETPDCD 
Building  90 

Great  Lakes  NTC  IL  60068 
Dr.  J.  Ryan 

Department  of  Education 
University  of  South  Carolina 
Colufflbia,  SC  29206 

Dr.  Fumiko  Samejima 
Department  of  Psychology 
University  of  Tennessee 
JlOB  Austin  Peay  Bldg 
Knoxville,  TN  379164)900 

Mr.  Drew  Sands 

NPRDC  Code  62 

San  Diego.  CA  92152-6800 


uswell  Seboer 

Pychological  A  Ouantitative 
Foundations 
College  of  Education 
University  of  Iowa 
Iowa  City.  lA  52242 

Dr.  Mary  Sebrau 
4100  Pat^ide 
Carlsbad.  CA  92006 

Dr.  E>an  Sepll 

Navy  Personnel  R&D  Center 

San  Diego,  CA  92152 

Dr.  Robin  Sbealy 
University  of  Illinois 
Departffient  of  Statistics 
101  Illini  Hall 
725  South  Wrigbt  Sl 
Champaign,  IL  61820 

Dr.  Kazuo  Shigemasu 
7-9-24  Kugenuma-Kalgan 
Fujisawa  2S1 
JAPAN 

Dr.  Randall  Shumaker 
Naval  Research  Laboratorv- 
Code  5510 

4555  (Xerlook  Avenue.  S.W 
Washington,  DC  20375-50U0 

Dr.  Richard  E,  Snow 
School  of  Education 
Stanford  University 
Stanford.  CA  94305 

Dr.  Richard  C  Soren^n 
Nfvy  Personnel  R&D  Center 
San  Diega  CA  92152-6800 

Dr.  Judy  Spray 
ACT 

P.O.  Box  168 
Iowa  Ccy.  lA  52245 

Dr.  Martha  Stocking 
Educational  Testing  Service 
Pnnceion.  NJ  08541 

Dr.  Peter  StolofT 
Center  for  Naval  Anahsis 
4401  Ford  Avenue 
P.O  Box  162b8 
Alexandna.  VA  22302-02ni' 

Dr.  William  Stout 
Unrversiry  of  Illinois 
Department  of  Staustics 
101  llhni  Hall 
725  South  Wnght  Sl 
Champaign.  IL  til820 

Dr.  Hanharan  Swammathan 
Laboratory  of  Psychomeinc  anJ 
Evaluation  Research 
Senool  of  Education 
University  of  Massachusetts 
Amherst,  MA  01003 

Mr.  Brad  Sympson 

Navv  Personnel  R&D  Center 

Code-62 

San  Diego.  CA  92152-6800 

Dr.  John  Tangney 
AFOSR/NL  Bldg  410 
Bolling  AFB,  DC  20332-^448 

Dr.  Kikumi  Tatsuoka 
Educa.ional  Testing  Service 
Mail  Stop  03-T 
Pnneeton.  NJ  06541 


Univcnity  of  lUitKM/3unker 


Dr.  Maurice  Tauuoka 
Educational  TcMing  Service 
Meil  Stop  Q3>T 
Princetoa  NJ  08S41 

Dr.  David  Tbiaacn 
Department  of  Paycbology 
Univenity  of  Kanam 
Lawrence.  KS  dd<M4 

Mr.  Tbomaa  J.  Tbomat 
.'olma  Hopkina  UniveraiQr 
Department  of  Piycbology 
Cbartes  4  Mth  Stfaet 
Baltimore.  MD  2U18 

Mr.  Gary  Tbomaaaoo 
Univemty  of  illinoia 
Educational  Paycbology 

Champaign.  IL  61820 

Dr.  Robert  Ttutakawa 
Univcraity  of  Miaaouri 
Department  of  Statiatka 
Z22  Math.  Saencea  Bldg. 

Cciumbia.  MO  6S211 

[>r.  L^dyard  Tucker 
L/ntver':(y  of  liiinoia 
Department  of  Paycbology 
603  E  Daniel  Street 
Champaign.  IL  61820 

Dr.  David  Vale 
Aaaeaament  Syatema  Corp. 

2233  UnNcraity  Avenue 
Suite  640 

St  Paul.  MN  55114 

Dr.  Frank  L  VidrM> 

Navy  Penonnel  RAD  Center 
San  Diega  CA  92152-d«IO 

Dr.  Howard  Wainer 
Educational  Teating  Service 
Princeton,  NJ  08541 

Dr.  M'lchad  T.  Waller 
Univenicy  of  Wboonain-Milwaukee 
Educational  Paycbology  Department 
Boi  413 

Milwaukee.  W1  53201 

Dr.  Ming-Mci  Wang 
Educational  Teating  Service 
Mail  Stop  03-T 
Princetoa  NJ  08541 

Dr.  Tbomaa  A.  Warm 
FAA  Academy  AAC934D 
P.O.  Box  25082 
Oklahoma  Oty.  OK  73125 

Dr.  Brian  Waten 

HumRRO 

1100  S  Waahington 

Alemndna.  VA  22314 


1>.  Rand  R.  Wtkoi 
Univenicy  of  Soutbem 
California 

Department  of  Paycbology 
lot  Angelea.  CA  90089-1061 

German  Military  RepreacntatA*e 
ATTN:  Wolfgang  Wtidgrube 
Streitkraefteami 
D-5300  Bonn  2 
4000  Brartdywine  Street  NW 
Waabingloa  DC  20016 

Dr.  Bnioe  WUliama 
Department  of  Educational 
Paycbology 
Univeraity  of  lUiooia 
Urbana.  IL  61801 

Dr.  Hilda  Wing 

Federal  Aviation  Adminiatration 
800  Independence  Ave,  SW 
Waahingloa  OC  20591 

Mr.  John  H.  Wolfe 

Navy  Penonnel  RAD  Center 

San  Diego,  CA  92152-6800 

Dr.  George  Woog 
Biottaasucs  Laboratory 
Memorial  Sloan-Kettenng 
Cancer  Center 
12?S  York  Avenue 
New  York.  NY  10021 

Or.  Wallace  Wulfeck,  (fl 
Navy  Peraortnel  RAD  Center 
Code  51 

San  Diego,  CA  92152-6800 
Dr.  ICentaro  Yamamoto 

02-T 

Educational  Teating  Service 
Roaedak  Road 
Princetoa  NJ  08541 

Dr.  Wendy  Yen 
CTB/McGraw  Hill 
Dd  Monte  Research  Part 
Monterey.  CA  93940 

Dr.  Joseph  L  Young 
National  Science  Foundation 
Room  320 
1800  G  Street  N-W. 
Waahingloa  DC  20550 

Mr.  Anthony  R.  Zara 
National  Council  of  Stale 
Board!  of  Nursing,  inc 
625  North  Michigan  Avenue 
Suite  1544 
Chicago.  IL  60611 


Dr.  David  J.  Weas 
N660  EJliotl  Hall 
Univenity  of  Minneaou 
75  E  Road 
Minneapda  MN  S54SS-0344 

Dr.  Ronald  A.  Wciizman 
Box  146 

CarmelCA  93921 

Mafor  John  Webfa 
AFHRUMOAN 
Brook!  AFa  TX  78223 

Dr.  Dougla*  Wetzel 
Code  51 

Navy  Peraonnd  RAD  Center 
San  Dkgo.  CA  92152-6800 


